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ENDOGENOUS AND NON-ENDOGENOUS VERSIONS OF 
HUMAN G PROTEIN-COUPLED RECEPTORS 

FIELD OF THE INVENTION 

5 

The invention disclosed in this patent document relates to transmembrane 
receptors, and more particularly to human G protein-coupled receptors, and 
specifically to endogenous human GPCRs with particular emphasis 'on non- 
endogenous versions of the GPCRs that have been altered to establish or enhance 
10 constitutive activity of the receptor. Preferably, the altered GPCRs are used for the 
direct identification of candidate compounds as receptor agonists, inverse agonists or 
partial agonists having potential applicability as therapeutic agents. 



1 5 BACKGROUND OF THE INVENTION 

Although a number of receptor classes exist in humans, by far the most abundant 
and therapeutically relevant is represented by the G protein-coupled receptor (GPCR or 
GPCRs) class. It is estimated that there are some 100,000 genes within the human 

20 genome, and of these, approximately 2%, or 2,000 genes, are estimated to code for 
GPCRs. Receptors, including GPCRs, for which the endogenous ligand has been 
identified are referred to as "knovm" receptors, while receptors for which the 
endogenous ligand has not been identified are referred to as "orphan" receptors. GPCRs 
represent an important area for the development of pharmaceutical products: firom 

25 approximately 20 of the 100 known GPCRs, approximately 60% of all prescription 
pharmaceuticals have been developed. 

GPCRs share a common structural motif. All these receptors have seven 
sequences of between 22 to 24 hydrophobic amino acids that form seven alpha helices, 
each of which spans the membrane (each span is identified by number, z.e., 

30 transmembrane- 1 (TM-1), transmebrane-2 (TM-2), etc.). The transmembrane hehces 
are joined by strands of amino acids between transmembrane-2 and transmembrane-3. 
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transmembrane-4 and transmembrane-5, and transmembrane-6 and transmembrane-? on 
the exterior, or "extracellular" side, of the cell membrane (these are referred to as 
"extracellular" regions 1, 2 and 3 (EC-1, EC-2 and EC-3), respectively). The 
transmembrane helices are also joined by strands of amino acids between 
5 transmembrane- 1 and transmembrane-2, transmembrane-3 and transmembrane-4, and 
transmembrane-5 and transmembrane-6 on the interior, or "intracellular" side, of the cell 
membrane (these are referred to as "intracellular" regions 1, 2 and 3 (IC-1, IC-2 and IC- 
3), respectively). The "carboxy" ("C") temiinus of the receptor lies in the intracellular 
space within the cell, and the "amino" ("N") terminus of the receptor lies in the 

1 0 extracellular space outside of the cell. 

Generally, when an endogenous ligand binds with the receptor (often referred to 
as "activation" of the receptor), there is a change in the conformation of the intracellular 
region that allows for coupling between the intracellular region and an intracellular "G- 
protein." It has been reported that GPCRs are "promiscuous" with respect to G proteins, 

15 z.e., that a GPCR can interact with more than one G protein. See, Kenakin, T., 43 Life 
Sciences 1095 (1988). Although other G proteins exist, currently, Gq, Gs, Gi, Gz and Go 
are G proteins that have been identified. Endogenous ligand-activated GPCR coupling 
with the G-protein begins a signaling cascade process (referred to -as "signal 
transduction"). Under normal conditions, signal transduction ultimately results in 

20 cellular activation or cellular inhibition. It is thought that the IC-3 loop as well as the 
carboxy terminus of the receptor interact with the G protein. 

Under physiological conditions, GPCRs exist in the cell membrane in 
equilibrium between two different conformations: an "inactive" state and an "active" 
state. A receptor in an inactive state is unable to link to the intracellular signaling 

25 transduction pathway to produce a biological response. Changing the receptor 
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conformation to the active state allows linkage to the transduction pathway (via the G- 
protein) and produces a biological response. 

A receptor may be stabilized in an active state by an endogenous ligand or a 
compound such as a drug. Recent discoveries, including but not exclusively limited to 
modifications to the amino acid sequence of the receptor, provide means other than 
endogenous ligands or drugs to promote and stabilize the receptor in the active state 
conformation. These means effectively stabilize the receptor in an active state by 
simulating the efTect of an endogenous ligand binding to the receptor. StabiHzation by 
such ligand-indcpcndenl means is termed "constitutive receptor activation." 

SUMMARY OF THE INVENTION 
Disclosed herein are endogenous and non-endogenous versions of human 
GPCRs and uses thereof. 



1 5 B RIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 provides an illustration of second messenger IP3 production from 
endogenous version RUP12 ("RUP12") as compared with the control ("CMV'). 

Figure 2 is a graphic representation of the results of a second messenger cell- 
based cyclic AMP assay providing comparative results for constitutive signaling of 
20 endogenous RUP 1 3 C^RUP 1 3") and a control vector ("CMV"). 

Figure 3 is a diagrammatic representation of the signal measured comparing 
CMV, endogenous RUP 13 ("RUP 13 wt") and non-endogenous, constitutively activated 
RUP 13 ("RUP13(A268K)"), utiUzing 8XCRE-Luc reporter plasmid. 
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Figure 4 is a graphic representation of the results of a [^^SJGTPyS assay 
providing comparative resuUs for constitutive signaling by RUP13:Gs Fusion Protein 
("RUP13-Gs") and a control vector ("CMV"). 

Figure 5 is a diagrammatic representation of the signal measured comparing 
5 CMV, endogenous RUP14 ("RUP14 wt") and non-endogenous, constitutively activated 
RUP13 CRUP14(L246K)"), utilizing 8XCRE-Luc reporter plasmid. 

Figure 6 is a diagrammatic representation of the signal measured comparing 
CMV, endogenous RUP15 ("RUP15 wt") and non-endogenous, constitutively activated 
RUP15 ("RIJP15(A398K)"), utilizing 8XCRE-Luc reporter plasmid. 
10 Figure 7 is a graphic representation of the results of a second messenger cell- 

based cyclic AMP assay providing comparative results for constitutive signaling of 
endogenous RUP15 ("RUP15 wt"), non-endogenous, constitutively activated version o£ 
RUP15 C*RUP15(A398K)") and a control vector ("CMV"). 

Figure 8 is a graphic representation of the results of a [^^SJGTPyS assay 
15 providing comparative results for constitutive signaling by RUP15:Gs Fusion Protein 
("RUPl 5-Gs") and a control vector ("CMV"). 

Figure 9 provides an illustration of second messenger IP3 production from 
endogenous version RUP17 ("RUPIT") as compared M^ith the control ("CMV"). 

Figure 10 provides an illustration of second messenger IP3 production from 
20 endogenous version RUP21 ("RUP21") as compared with the control ("CMV"). 

Figure 11 is a diagrammatic representation of the signal measured comparing 
CMV, endogenous RUP23 ("RUP23 wt") and non-endogenous, constitutively activated 
RUP23 ("RUP23(W275K)"), utilizing 8XCRE-Luc reporter plasmid. 
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Figure 12 is a graphic rq^resentation of results from a primary screen of several 
candidate compounds against RUP13; results for "Compound A" are provided in well 
A2 and "Compound "B" are provided in well G9. 



5 DETAILED DESCRIPTION 

The scientific literature that has evolved around receptors has adopted a number 
of terms to refer to ligands having various effects on receptors. For clarity and 
consistency, the following definitions will be used throughout this patent document. To 
the extent that these definitions conflict with other defmitions for these terms, the 
1 0 following definitions shall control: 

AGONISTS shall mean materials (e.g., Ugands, candidate compounds) that 

activate the intracellular response when they bind to the receptor, or enhance GTP 

binding to membranes. 

AMINO ACID ABBREVIATIONS used herein are set out in Table A: 

TABLE A 



ALANINE 


ALA 


A 


ARGININE 


ARG 


R 


ASPARAGINE 


ASN 


N 


ASPARTIC ACID 


ASP 


D 


CYSTEINE 


CYS 


C 


GLUTAMIC ACID 


GLU 


E 


GLUTAMINE 


GLN 


Q 


GLYCINE 


GLY 


G 


fflSTIDINE 


fflS 


H 


ISOLEUCINE 


ILE 


I 


LEUCINE 


LEU 


L 


LYSINE 


LYS 


K 


METfflONINE 


MET 


M 
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PHENYL ALAlsHNE 


PHE 


F 


PROLINE 


PRO 


P 


SERINE 


SER 


s 


THREONINE 


THR 


T 


TRYPTOPHAN 


TRP 


W 


TYROSINE 


TYR 


Y 


VALINE 


VAL 


V 



PARTIAL AGONISTS shall mean materials (e.g,, ligands, candidate 
compounds) that activate the intracellular response when they bind to the receptor to a 
lesser dcgrcc-cxteni than do agonists, or enhance GTP binding to membranes to a lesser 
5 degree cxicnl than do agonists. 

ANTAGONIST shall mean materials (e,g,, ligands, candidate compounds) that 
compctjiivcly bind lo the receptor at the same site as the agonists but which do not 
activate ihc intracellular response initiated by the active form of the receptor, and can 
thereby inhibit the intracellular responses by agonists or partial agonists. 
10 ANTAGONISTS do not diminish the baseline intracellular response in the absence of an 
agonist or partial agonist. 

CANDIDATE COMPOUND shall mean a molecule (for example, and not 
limitation, a chemical compound) that is amenable to a screening technique. Preferably, 
the phrase "candidate compound" does not include compounds which were publicly 
15 knovra to be compounds selected from the group consisting of inverse agonist, agonist or 
antagonist to a receptor, as previously determined by an indirect identification process 
("indirectly identified compound"); more preferably, not including an indirectly 
identified compound which has previously been determined to have therapeutic efficacy 
in at least one mammal; and, most preferably, not including an indirectly identified 
20 compound which has previously been determined to have therapeutic utility in humans. 
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COMPOSITION means a material comprising at least one component; a 
"phamiaceutical composition" is an example of a composition. 

COMPOUND EFFICACY shall mean a measurement of the ability of a 
compound to inhibit or stimulate receptor functionality, as opposed to receptor binding 
5 affinity. Exemplary means of detecting compound efficacy are disclosed in the Example 
section of this patent document. 

CODON shall mean a grouping of three nucleotides (or equivalents to 
nucleotides) which generally comprise a nucleoside (adenosine (A), guanosine (G), 
c>iidinc (C), uridine (U) and thymidine (T)) coupled to a phosphate group and which, 
1 0 when translated, encodes an amino acid. 

CONSTITUTTVELY ACTIVATED RECEPTOR shall mean a receptor 
subject to constitutive receptor activation. A constitutively activated receptor can be 
endogenous or non-endogenous. 

CONSTITUTIVE RECEPTOR ACTFS^ATION shall mean stabiUzation of a 
15 receptor in the active state by means other than binding of. the receptor with its 
endogenous ligand or a chemical equivalent thereof. 

CONTACT or CONTACTING shall mean bringing at least two moieties 
together, whether in an in vitro system or an in vivo system. - 

DIRECTLY IDENTIFYING or DIRECTLY IDENTIFIED, m relationship to 
20 the phrase "candidate compound", shall mean the screening of a candidate compound 
against a constitutively activated receptor, preferably a constitutively activated orphan 
receptor, and most preferably against a constitutively activated G protein-coupled cell 
surface orphan receptor, and assessing the compound efficacy of such compound. This 
phrase is, under no circumstances, to be interpreted or understood to be encompassed by 
25 or to encompass the phrase "indirectly identifying" or "indirectly identified." 

7 
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ENDOGENOUS shall mean a material that a mammal naturally produces. 
ENDOGENOUS in reference to, for example and not limitation, the term "receptor," 
shall mean that which is naturally produced by a mammal (for example, and not 
limitation, a human) or a virus. By contrast, the term NON-ENDOGENOUS in this 
5 context shall mean that which is not naturally produced by a mammal (for example, and 
not limitation, a human) or a virus. For example, and not limitation, a receptor which is 
not constitutively active in its endogenous form, but when manipulated becomes 
constitutively active, is most preferably referred to herein as a "non-endogenous, 
constitutively activated receptor." Both terms can be utilized to describe both "in vivo" 
10 and "in vitro" systems. For example, and not lunitation, in a screening approach, the 
endogenous or non-endogenous receptor may be in reference to an in vitro screening 
system. As a further example and not limitation, where the genome of a mammal has 
been manipulated to include a non-endogenous constitutively activated receptor, 
screening of a candidate compound by means of an in vivo system is viable. 
15 G PROTEIN COUPLED RECEPTOR FUSION PROTEIN and GPCR 

FUSION PROTEIN, in the context of the invention disclosed herein, each mean a non- 
endogenous protein comprising an endogenous, constitutively activate GPCR or a non- 
endogenous, constitutively activated GPCR fused to at least one G pr(3Tein, most 
preferably the alpha (a) subunit of such G protein (this being the subunit that binds 
20 GTP), with the G protein preferably being of the same type as the G protein that 
naturally couples with endogenous orphan GPCR. For example, and not limitation, in an 
endogenous state, if the G protein "Gsa" is the predominate G protein that couples with 
the GPCR, a GPCR Fusion Protein based upon the specific GPCR would be a non- 
endogenous protein comprising the GPCR fused to Gsa; in some circumstances, as wiD 
25 be set forth below, a non-predominant G protein can be fused to the GPCR. The G 

8 
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protein can be fused directly to the c-terminus of the constitutively active GPCR or there 
may be spacers between the two. 

HOST CELL shall mean a cell capable of having a Plasmid and/or Vector 
incorporated therein. In the case of a prokaryotic Host Cell, a Plasmid is typically 
5 replicated as a autonomous molecule as the Host Cell replicates (generally, the Plasmid 
is thereafter isolated for introduction into a eukaryotic Host Cell); in the case of a 
eukaryotic Host Cell, a Plasmid is integrated into the cellular DNA of the Host Cell such 
that when the eukaryotic Host Cell replicates, the Plasmid replicates. Preferably, for the 
purposes of the invention disclosed herein, the Host Cell is eukaryotic, more preferably, 
10 mammalian, and most preferably selected from the group consisting of 293, 293T and 
COS-7 cells. 

INDIRECTLY IDENTIFYING or INDIRECTLY IDENTIFIED means the 
traditional approach to the drug discovery process involving identification of an 
endogenous ligand specific for an endogenous receptor, screening of candidate 
15 compounds against the receptor for determination of those which interfere and/or 
compete with the ligand-receptor interaction, and assessing the efficacy of the compound 
for affecting at least one second messenger pathway associated with the activated 
receptor. — 

INHIBIT or INHIBITING, in relationship to the term "response" shall mean 
20 that a response is decreased or prevented in the presence of a compound as opposed to in 
the absence of the compound. 

INVERSE AGOMSTS shall mean materials (e.g-., ligand, candidate compound) 
which bind to either the endogenous form of the receptor or to the constitutively 
activated form of the receptor, and which inhibit the baseline intracellular response 
25 initiated by the active fomi of the receptor below the normal base level of activity which 

9 
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is observed in the absence of agonists or partial agonists, or decrease GTP binding to 
membranes. Preferably, the baseline intracellular response is inhibited in the presence of 
the inverse agonist by at least 30%, more preferably by at least 50%, and most preferably 
by at least 75%, as compared with the baseline response in the absence of the inverse 
5 agonist. 

KNOWN RECEPTOR shall mean an endogenous receptor for which the 
endogenous ligand specific for that receptor has been identified. 

LIGAND shall mean an endogenous, naturally occurring molecule specific for 
an endogenous, naturally occurring receptor. 

10 MUTANT or MUTATION in reference to an endogenous receptor's nucleic 

acid and/or amino acid sequence shall mean a specified change or changes to such 
endogenous sequences such that a mutated form of an endogenous, non-constitutively 
activated receptor evidences constitutive activation of the receptor. In terms of 
equivalents to specific sequences, a subsequent mutated form of a human receptor is 

15 considered to be equivalent to a first mutation of the human receptor if (a) the level of 
constitutive activation of the subsequent mutated form of a human receptor is 
substantially the same as that evidenced by the fu^t mutation of the receptor; and (b) the 
percent sequence (amino acid and/or nucleic acid) homology between the subsequent 
mutated form of the receptor and the first mutation of the receptor is at least about 80%, 

20 more preferably at least about 90% and most preferably at least 95%. Ideally, and owing 
to the fact that the most preferred cassettes disclosed herein for achieving constitutive 
activation includes a single amino acid and/or codon change between the endogenous 
and the non-endogenous forms of the GPCR, the percent sequence homology should be 
at least 98%. 



10 



BNSDCCID: <WO 0136471A2.L> 



wo 01/36471 PCT/USOO/31509 

NON-ORPHAN RECEPTOR shall mean an endogenous naturally occurring 
molecule specific for an endogenous naturally occurring ligand wherein the binding of a 
ligand to a receptor activates an intracellular signaling pathway. 

ORPHAN RECEPTOR shall mean an endogenous receptor for which the 
5 endogenous ligand specific for that receptor has not been identified or is not known. 

PHARMACEUTICAL COMPOSITION shall mean a composition 
comprising at least one active ingredient, whereby the composition is amenable to 
investigation for a specified, efficacious outcome in a mammal (for example, and not 
limitation, a human). Those of ordinary skill in the art will understand and appreciate the 
10 techniques appropriate for determining whether an active ingredient has a desired 
efficacious outcome based upon the needs of the artisan. 

PLASMTD shall mean the combination of a Vector and cDNA. Generally, a 
Plasmid is introduced into a Host Cell for the purposes of replication and/or expression 
of the cDNA as a protein. 
15 SECOND MESSENGER shall mean an intracellular response produced as a 

result of receptor activation. A second messenger can include, for example, inositol 
triphosphate (IP3), diacycglycerol (DAG), cyclic AMP (cAMP), and cyclic GMP 
(cGMP). Second messenger response can be measured for a determination of-receptor 
activation. In addition, second messenger response can be measured for the direct 
20 identification of candidate compounds, including for example, inverse agonists, agonists, 
partial agonists and antagonists. 

STIMULATE or STIMULATING, in relationship to the term "response" shall 
mean that a response is increased in the presence of a compound as opposed to in the 
absence of the compound. 
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VECTOR in reference to cDNA shall mean a circular DNA capable of 
incorporating at least one cDNA and capable of incorporation into a Host Cell. 

The order of the following sections is set forth for presentational efficiency and 
is not intended, nor should be construed, as a limitation on the disclosure or the claims to 
5 follow. 

A. Introduction 

The traditional study of receptors has always proceeded fi^om the a priori 
assumption (historically based) that the endogenous ligand must first be identified before 

10 discovery could proceed to find antagonists and other molecules that could affect the 
receptor. Even in cases where an antagonist might have been knovm first, the search 
immediately extended to looking for the endogenous ligand. This mode of thinking has 
persisted in receptor research even after the discovery of constitutively activated 
receptors. What has not been heretofore recognized is that it is the active state of the 

15 receptor that is most useful for discovering agonists, partial agonists, and inverse 
agonists of the receptor. For those diseases which result fi*om an overly active receptor 
or an under-active receptor, what is desired in a therapeutic drug is a compound which 
acts to diminish the active state of a receptor or enhance the activity of the receptor, 
respectively, not necessarily a dmg which is an antagonist to the endogenous ligand. 

20 This is because a compound that reduces or enhances the activity of the active receptor 
state need not bind at the same site as the endogenous ligand. Thus, as taught by a 
method of this invention, any search for therapeutic compounds should start by 
screening compounds against the ligand-independent active state. 

25 B. Identification of Human GPCRs 

12 
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The efforts of the Hiiman Genome project has led to the identification of a 
plethora of information regarding nucleic acid sequences located within the human 
genome; it has been the case in this endeavor that genetic sequence information has been 
made available without an understanding or recognition as to whether or not any 

5 particular genomic sequence does or may contain open-reading firame information that 
translate human proteins. Several methods of identifying nucleic acid sequences within 
the human genome are within the purview of those having ordinary skill in the art. For 
example, and not limitation, a variety of human GPCRs, disclosed herein, were 
discovered by reviewing the GenBank™ database. Table B, below, lists several 

10 endogenous GPCRs that we have discovered, along with other GPCR's that are 
homologous to the disclosed GPCR. 

TABLE B 



Disclosed 
Human 
Orphan GPCRs 


Accession 
Number 
Identified 


Open Reading 
Frame 
(Base Pairs) 


Reference To 
Homologous 
GPCR 


Per Cent 
Homology 
To Designated 
GPCR 


hRUPS 


AL121755 


l,152bp 


NPY2R 


27% 


hRUP9 


ACO 113375 


l,260bp 


GAL2R 


22% 


hRUPlO 


AC008745 


l,014bp 


C5aR 


40% 


hRUPll 


ACO 13396 


l,272bp 


HM74 


36% 


hRUP12 


AP000808 


966bp 


Masl 


34% 


hRUP13 


ACO 11780 


l,356bp 


Fish GPRX- 
ORYLA 


43% 


hRUP14 


AL137118 


l,041bp 


CysLTlR 


35% 


hRUPlS 


ALO 16468 


l,527bp 


RE2 


30% 


hRUP16 


AL136106 


l,068bp 


GLRlOl 


37% 


hRUP17 


AC023078 


969bp 


Masl 


37% 


hRUPlS 


AC008547 


l,305bp 


Oxytocin 


31% 


I1RUPI9 


AC026331 


l,041bp 


HM74 


52% 


hRUP20 


AL161458 


1,01 Ibp 


GPR34 


25% 


hRUP21 


AC026756 


l,014bp 


P2Y1R 


37% 


hRUP22 


AC027026 


993bp 


RIJP17 
Masl 


67% 
37% 
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hRUP23 


AC007104 


l,092bp 


Rat GPR26 


31% 


hRUP24 


AL355388 


l,125bp 


SALPR 


44% 


hRUP25 


AC026331 


l,092bp 


HM74 


95% 


hRUP26 


AC023040 


l,044bp 


Rabbit 5HT1D 


27% 


hRUP27 


AC027643 


158,700 


MCH 


38% 



Receptor homology is useful in terms of gaining an appreciation of a role of the 
receptors within the human body. As the patent document progresses, we will disclose 
techniques for mutating these receptors to establish non-endogenous, constitutively 
5 activated versions of these receptors. 

The techniques disclosed herein have also been applied to other human, orphan 
GPCRs known to the art, as will be apparent as the patent document progresses. 

C. Receptor Screening 

10 Screening candidate compounds against a non-endogenous, constitutively 

activated version of the human GPCRs disclosed herein allows for the direct 
identification of candidate compounds which act at this cell surface receptor, without 
requiring use of the receptor's endogenous ligand. Using routine, and often 
commercially available techniques, one can determine areas within the body where the 

15 endogenous version of human GPCRs disclosed herein is expressed and/or over- 
expressed. It is also possible using these techniques to determine related 
disease/disorder states which are associated with the expression and/or over-expression 
of the receptor; such an approach is disclosed in this patent document. 

With respect to creation of a mutation that may evidence constitutive activation 

20 of the human GPCR disclosed herein is based upon the distance fi-om the proline residue 
at which is presumed to be located within TM6 of the GPCR; this algorithmic technique 
is disclosed in co-pending and commonly assigned patent document PCT Application 

14 
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Number PCT/US99/23938, published as WO 00/22129 on April 20, 2000, which, along 
with the other patent documents listed herein, is incorporated herein by reference. The 
algorithmic technique is not predicated upon traditional sequence "alignment" but rather 
a specified distance from the aforementioned TM6 proline residue (or, of course, 
5 endogenous constitutive substitutionf for such proline residue). By mutating the amino 
acid residue located 16 amino acid residues from this residue (presumably located in the 
1C3 region of the receptor) to, most preferably, a lysine residue, such activation may be 
obtained. Other amino acid residues may be useful in the mutation at this position to 
achieve ihis objective. 

10 

D. Disease/Disorder Identification and/or Selection 

As u ill be set forth in greater detail below, most preferably inverse agonists and 
agonists to the non-endogenous, constitutively activated GPCR can be identified by the 
methodologies of this invention. Such inverse agonists and agonists are ideal candidates 
15 as lead compounds in drug discovery programs for treating diseases related to this 
receptor. Because of the ability to directly identify inverse agonists to the GPCR, 
thereby allowing for the development of pharmaceutical compositions, a search for 
diseases and disorders associated with the GPCR is relevant. For example, scanning 
both diseased and normal tissue samples for the presence of the GPCR now becomes 
20 more than an academic exercise or one which might be pursued along the path of 
identifying an endogenous ligand to the specific GPCR. Tissue scans can be conducted 
across a broad range of healthy and diseased tissues. Such tissue scans provide a 
preferred first step in associating a specific receptor with a disease and/or disorder. 

Preferably, the DNA sequence of the human GPCR is used to make a probe for 
25 (a) dot-blot analysis against tissue-mRNA, and/or (b) RT-PCR identification of the 
expression of the receptor in tissue samples. The presence of a receptor in a tissue 

15 
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source, or a diseased tissue, or the presence of the receptor at elevated concentrations in 
diseased tissue compared to a normal tissue, can be preferably utihzed to identify a 
correlation with a treatment regimen, including but not limited to, a disease associated 
with that disease. Receptors can equally well be localized to regions of organs by this 
5 technique. Based on the known functions of the specific tissues to which the receptor is 
localized, the putative functional role of the receptor can be deduced. 

E. Screening of Candidate Compounds 

1. Generic GPCR screening assay techniques 

1 0 When a G protein receptor becomes constitutively active, it binds to a G protein 

(e.g,, Gq, Gs, Gi, Gz, Go) and stimulates the binding of GTP to the G protein. The G 
protein then acts as a GTPase and slowly hydrolyzes the GTP to GDP, whereby the 
receptor, under normal conditions, becomes deactivated. However, constitutively 
activated receptors continue to exchange GDP to GTP. A non-hydrolyzable analog of 

15 GTP, [^^SJGTPyS, can be used to monitor enhanced binding to membranes which 
express constitutively activated receptors. It is reported that [^^S]GTPyS can be used to 
monitor G protein coupling to membranes in the absence and presence of ligand. An 
example of this monitoring, among other examples well-known and available to those in 
the art, was reported by Traynor and Nahorski in 1995. The preferred use of this assay 

20 system is for initial screening of candidate compounds because the system is generically 
applicable to all G protein-coupled receptors regardless of the particular G protein that 
interacts with the intracellular domain of the receptor. 

2. Specific GPCR screening assay techniques 

Once candidate compounds are identified using the "generic" G protein-coupled 
25 receptor assay (Le., an assay to select compounds that are agonists, partial agonists, or 
inverse agonists), further screening to confirm that the compounds have interacted at the 
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receptor site is preferred. For example, a compound identified by the "generic" assay 
may not bind to the receptor, but may instead merely "uncouple" the G protein jfrom the 
intracellular domain. 

a, Gsy Gz and Gi. 

5 Gs stimulates the enzyme adenylyl cyclase. Gi (and Gz and Go), on the other 

hand, inhibit this enzyme. Adenylyl cyclase catalyzes the conversion of ATP to cAMP; 
thus, constitutively activated GPCRs that couple the Gs protein are associated with 
increased cellular levels of cAMP. On the other hand, constitutively activated GPCRs 
that couple Gi (or Gz, Go) protein are associated with decreased cellular levels of cAMP. 
10 See, generally, "Indirect Mechanisms of Synaptic Transmission," Chpt. 8, From Neuron 
To Brain (3^** Ed.) Nichols, J.G. et al eds. Sinauer Associates, Lie. (1992). Thus, assays 
that detect cAMP can be utilized to determine if a candidate compoimd is, e.g., an 
inverse agonist to the receptor (Le., such a compound would decrease the levels of 
cAMP). A variety of approaches known in the art for measuring cAMP can be utilized; 
15 a most preferred approach relies upon the use of anti-cAMP antibodies in an ELISA- 
based format. Another type of assay that can be utilized is a whole cell second 
messenger reporter system assay. Promoters on genes drive the expression of the 
proteins that a particular gene encodes. Cyclic AMP drives gene expression by 
promoting the binding of a cAMP-responsive DNA binding protein or transcription 
20 factor (CREB) that then binds to the promoter at specific sites called cAMP response 
elements and drives the expression of the gene. Reporter systems can be constructed 
which have a promoter containing multiple cAMP response elements before the reporter 
gene, e.g., p-galactosidase or luciferase. Thus, a constitutively activated Gs-linked 
receptor causes the accumulation of cAMP that then activates the gene and expression of 
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the reporter protein. The reporter protein such as p-galactosidase or luciferase can then 
be detected using standard biochemical assays (Chen et al. 1995). 
b. Go and Gq. 

5 Gq and Go are associated with activation of the enzyme phospholipase C, which 

in turn hydrolyzes the phospholipid PIP2, releasing two intracellular messengers: 
diacycloglycerol (DAG) and inistol 1,4,5-triphoisphate (JPi). Increased accumulation of 
IPji is associated with activation of Gq- and Go-associated receptors. See, generally, 
"Indirect Mechanisms of Synaptic Transmission," Chpt. 8, From Neuron To Brain (3*^^ 

10 Ed.) Nichols, J.G. ct al eds. Sinauer Associates, Inc. (1992). Assays that detect IP3 
accumulation can be utilized to determine if a candidate compound is, e,g,, an inverse 
agonist to a Gq- or Go-associated receptor {i.e., such a compound would decrease the 
levels of IPji). Gq -associated receptors can also been examined using an API reporter 
assay in thai Gq-depcndent phospholipase C causes activation of genes containing API 

15 elements; thus, activated Gq-associated receptors will evidence an increase in the 
expression of such genes, whereby inverse agonists thereto will evidence a decrease in 
such expression, and agonists will evidence an increase in such expression. 
Commercially available assays for such detection are available. 

3. GPCR Fusion Protein 

20 The use of an endogenous, constitutively activate orphan GPCR or a non- 

endogenous, conslitutively activated orphan GPCR, for use in screening of candidate 
compounds for the direct identification of inverse agonists, agonists and partial agonists 
provide an interesting screening challenge in that, by definition, the receptor is active 
even in the absence of an endogenous ligand boimd thereto. Thus, in order to 

25 differentiate between, e.g., the non-endogenous receptor in the presence of a candidate 
compound and the non-endogenous receptor in the absence of that compound, with an 
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aim of such a differentiation to allow for an understanding as to whether such compound 
may be an inverse agonist, agonist, partial agonist or have no affect on such a receptor, it 
is preferred that an approach be utilized that can enhance such differentiation. A 
preferred approach is the use of a GPCR Fusion Protein. 
5 Generally, once it is determined that a non-endogenous orphan GPCR has been 

constitutively activated using the assay techniques set forth above (as well as others), it 
is possible to determine the predominant G protein that couples with the endogenous 
GPCR. Coupling of the G protein to the GPCR provides a signaling pathway that can be 
assessed. Because it is most preferred that screening take place by use of a mammalian 
1 0 expression system, such a system will be expected to have endogenous G protein therein. 
Thus, by definition, in such a system, the non-endogenous, constitutively activated 
orphan GPCR will continuously signal. In this regard, it is preferred that this signal be 
enhanced such that in the presence of, e.g., an inverse agonist to the receptor, it is more 
likely that il will be able to more readily differentiate, particularly in the context of 
1 5 screening, between the receptor when it is contacted with the inverse agonist. 

The GPCR Fusion Protein is intended to enhance the efficacy of G protein 
coupling with the non-endogenous GPCR. The GPCR Fusion Protein is preferred for 
screening with a non-endogenous, constitutively activated GPCR because-such an 
approach increases the signal that is most preferably utilized in such screening 
20 techniques. This is important in facihtating a significant "signal to noise" ratio; such a 
significant ratio is import preferred for the screening of candidate compounds as 
disclosed herein. 

The construction of a construct useful for expression of a GPCR Fusion Protein 
is within the purview of those having ordinary skill in the art. Commercially available 
25 expression vectors and systems offer a variety of approaches that can fit the particular 
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needs of an investigator. The criteria of importance for such a GPCR Fusion Protein 
construct is that the endogenous GPCR sequence and the G protein sequence both be in- 
frame (preferably, the sequence for the endogenous GPCR is upstream of the G protein 
sequence) and that the "stop" codon of the GPCR must be deleted or replaced such that 
5 upon expression of the GPCR, the G protein can also be expressed. The GPCR can be 
linked directly to the G protein, or there can be spacer residues between the two 
(preferably, no more than about 12, although this number can be readily ascertained by 
one of ordinary skill in the art). We have a preference (based upon convenience) of use 
of a spacer in that some restriction sites that are not used will, effectively, upon 

1 0 expression, become a spacer. Most preferably, the G protein that couples to the non- 
endogenous GPCR will have been identified prior to the creation of the GPCR Fusion 
Protein constmct. Because there are only a few G proteins that have been identified, it is 
preferred that a construct comprising the sequence of the G protein (Le., a universal G 
protein construct) be available for insertion of an endogenous GPCR sequence therein; 

15 this provides for efficiency in the context of large-scale screening of a variety of 
different endogenous GPCRs having different sequences. 

As noted above, constitutively activated GPCRs that couple to Gi, Gz and Go are 
expected to inhibit the formation of cAMP making assays based upon these types of 
GPCRs challenging (i.e., the cAMP signal decreases upon activation thus making the 

20 direct identification of, e.g, inverse agonists (which would ftirther decrease this signal), 
interesting. As will be disclosed herein, we have ascertained that for these types of 
receptors, it is possible to create a GPCR Fusion Protein that is not based upon the 
endogenous GPCR's endogenous G protein, in an effort to establish a viable cyclase- 
based assay. Thus, for example, an endogenous Gi coupled receptor can be fused to a Gs 

25 protein - we believe that such a fusion construct, upon expression, "drives" or "forces" 
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the endogenous GPCR to couple with, e,g,, Gs rather than the "natural" Gi protein, such 
that a cyclase-based assay can be estabhshed. Thus, for Gi, Gz and Go coupled 
receptors, we prefer that that when a GPCR Fusion Protein is used and the assay is based 
upon detection of adenylyl cyclase activity, that the fusion construct be estabhshed with 
5 Gs (or an equivalent G protein that stimulates the formation of the enzyme adenylyl 
cyclase). 

Equally effective is a G Protein Fusion construct that utihzes a Gq Protein fused 
with a Gs, Gi, Gz or Go Protein. A most preferred fusion construct can be accomplished 
with a Gq Protein wherein the first six (6) amino acids of the G-protein a-subunit 

10 ("Gaq") is deleted and the last five (5) amino acids at the C-terminal end of Gaq is 
replaced with the corresponding amino acids of the Ga of the G protein of interest. For 
example, a fiision construct can have a Gq (6 amino acid deletion) fiised with a Gi 
Protein, resulting in a "Gq/Gi Fusion Construct". We believe that this fusion constmct 
will force the endogenous Gi coupled receptor to couple to its non-endogenous G 

15 protein, Gq, such that the second messenger, for example, inositol triphosphate or 
diacylgycerol, can be measured in lieu of cAMP production. 

4. Co-transfection of a Target Gi Coupled GPCR with a Signal- 
Enhancer Gs Coupled GPCR (cAMP Based Assays) ~ 

20 

A Gi coupled receptor is known to inhibit adenylyl cyclase, and, therefore, 
decrease the level of cAMP production, which can make assessment of cAMP levels 
challenging. An effective technique in measuring the decrease in production of cAMP as 
an indication of constitutive activation of a receptor that predominantly couples Gi upon 
25 activation can be accomplished by co-transfecting a signal enhancer, e.g., a non- 
endogenous, constitutively activated receptor that predominantly couples with Gs upon 
activation {e.g., TSHR-A623I, disclosed below), with the Gi linked GPCR. As is 
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apparent, constitutive activation of a Gs coupled receptor can be determined based upon 
an increase in production of cAMP. Constitutive activation of a Gi coupled receptor 
leads to a decrease in production cAMP. Thus, the co-transfection approach is intended 
to advantageously exploit these "opposite" affects. For example, co-transfection of a 
5 non-endogenous, constitutively activated Gs coupled receptor (the "signal enhancer") 
with the endogenous Gi coupled receptor (the "target receptor") provides a baseline 
cAMP signal (i.e., although the Gi coupled receptor v/ill decrease cAMP levels, this 
"decrease" will be relative to the substantial increase in cAMP levels established by 
constitutively activated Gs coupled signal enhancer). By then co-transfecting the signal 

10 enhancer with a constitutively activated version of the target receptor, cAMP would be 
expected to further decrease (relative to base line) due to the increased functional activity 
of the Gi target (z.e., which decreases cAMP). 

Screening of candidate compounds using a cAMP based assay can then be 
accomplished, with two provisos: first, relative to the Gi coupled target receptor, 

15 ''opposite" effects will result, i.e., an inverse agonist of the Gi coupled target receptor 
will increase the measured cAMP signal, while an agonist of the Gi coupled target 
receptor will decrease this signal; second, as would be apparent, candidate compounds 
thai are directly identified using this approach should be assessed independently to 
ensure that these do not target the signal enhancing receptor (this can be done prior to or 

20 after screening against the co-transfected receptors). 

F. Medicinal Chemistry 

Generally, but not always, direct identification of candidate compounds is 
preferably conducted in conjunction with compounds generated via combinatorial 
25 chemistry techniques, whereby thousands of compounds are randomly prepared for 
such analysis. Generally, the results of such screening will be compounds having 
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unique core structures; thereafter, these compounds are preferably subjected to 
additional chemical modification around a preferred core structure(s) to further 
enhance the medicinal properties thereof. Such techniques are known to those in the 
art and will not be addressed in detail in this patent document. 

5 

G- Pharmaceutical compositions 

Candidate compounds selected for further development can be formulated into 
pharmaceutical compositions using techniques well known to those in the art. Suitable 
pharmaceutically-acceptable carriers are available to those in the art; for example, see 
10 Remington's Pharmaceutical Sciences, 16^^ Edition, 1980, Mack Publishing Co., (Oslo 
et al., eds.). 

H. Other Utility 

Although a preferred use of the non-endogenous versions the hiunan GPCRs 
15 disclosed herein may be for the direct identification of candidate compounds as inverse 
agonists, agonists or partial agonists (preferably for use as phamiaceutical agents), these 
versions of human GPCRs can also be utilized in research settings. For example, in vitro 
and in vivo systems incorporating GPCRs can be utilized to further elucidate and 
imderstand the roles these receptors play in the human condition, both normal and 
20 diseased, as well as xmderstanding the role of constitutive activation as it applies to 
understanding the signaling cascade. The value in non-endogenous human GPCRs is 
that their utility as a research tool is enhanced in that, because of their unique features, 
non-endogenous human GPCRs can be used to understand the role of these receptors in 
the human body before the endogenous ligand therefore is identified. Other uses of the 
25 disclosed receptors will become apparent to those in the art based upon, inter alia, a 
review of this patent document. 
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EXAMPLES 

The following examples are presented for purposes of elucidation, and not 
limitation, of the present invention. While specific nucleic acid and amino acid 
sequences are disclosed herein, those of ordinary skill in the art are credited with the 
5 ability to make minor modifications to these sequences while achieving the same or 
substantially similar results reported below. The traditional approach to application or 
understanding of sequence cassettes from one sequence to another (e.g. from rat receptor 
to human receptor or from human receptor A to human receptor B) is generally 
predicated upon sequence alignment techniques whereby the sequences are aligned in an 

1 0 effort to determine areas of commonality. The mutational approach disclosed herein 
does not rely upon this approach but is instead based upon an algorithmic approach and a 
positional distance from a conserved proline residue located within the TM6 region of 
human GPCRs. Once this approach is secured, those in the art are credited with the 
ability to make minor modifications thereto to achieve substantially the same results (i.e., 

15 constitutive activation) disclosed herein. Such modified approaches are considered 
within the pmview of this disclosure. 
// 
// 
// 

20 // 

Example 1 

Endogenous Human Gpcrs 

1. IdeDtification of Human GPCRs 

The disclosed endogenous human GPCRs were identified based upon a review 
25 of the GenBank™ database information. While searching the database, the following 
cDNA clones were identified as evidenced below (Table C). 
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TABLE C 



Disclosed 
Human 
Orphan 
GPCRs 


Accession 
Number 
Identifled 


Complete DNA 
Sequence 
(Base Pairs) 


Open Reading 
Frame 
(Base Pairs) 


Nucleic 
Acid 

SEQ.ID. 
NO. 


Amino 
Acid 
SEQ.ID. 
NO. 


hRUPS 


AL121755 


147,566bp 


l,152bp 


1 


2 


hRUP9 


ACOl 13375 


143,181bp 


1 ,260bp 


3 


4 


hRUPlO 


AC008745 


94,194bp 


l,014bp 


5 


6 


hRUPll 


AC013396 


155,086bp 


l,272bp 


7 


8 


hRUP12 


AP000808 


177,764bp 


966bp 


9 


10 


hRUP13 


ACOl 1780 


167,819bp 


l,356bp 


11 


12 


hRUP14 


AL137118 


168,297bp 


l,041bp 


13 


14 


hRUPlS 


AL016468 


138,828bp 


l,527bp 


15 


16 


hRUP16 


AL136106 


208,042bp 


l,068bp 


17 


18 


hRUP17 


AC023078 


161,735bp 


969bp 


19 


20 


hRUPlS 


AC008547 


117,304bp 


l,305bp 


21 


22 


hRUP19 


AC026331 


145,183bp 


1,041 bp 


23 


24 


hRUP20 


AL161458 


163,5 llbp 


1,01 Ibp 


25 


26 


hRUP21 


AC026756 


156,534bp 


l,014bp 


27 


28 


hRlIP22 


AC027026 


15I,811bp 


993bp 


29 


30 


hRUP23 


AC007104 


200,000bp 


1.092bp 


31 


32 


hRUP24 


AL355388 


190,538bp 


l,125bp 


33 


34 


hRUP25 


AC026331 


145,1 83bp 


l,092bp 


35 


36 


hRUP26 


AC023040 


178,508bp 


l,044bp 


37 


38 


hRUP27 


AC027643 


158,700bp 


l,020bp 


39 


40 



5 2. Full Length Cloning 

a, hRUPS (Seq. Id. Nos, 1 & 2) 
The disclosed human RUP8 was identified based upon the use of EST database 
(dbEST) information. While searching the dbEST, a cDNA clone with accession number 
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AL121755 was identified to encode a novel GPCR. The following PGR primers were 
used for RT-PCR with human testis Marathon-Ready cDNA (Clontech) as templates: 
5'-CTTGCAGACATCACCATGGCAGCC-3' (SEQ.rD,NO.:41; sense) and 
5'-GTGATGCTCTGAGTACTGGACTGG-3' (SEQ.ID.NO.: 42; antisense). 
PGR was performed using Advantage cDNA polymerase (Glontech; manufacturing 
instructions will be followed) in 50ul reaction by the following cycles: 94'='G for 30 sec; 
94°G for 10 sec; 65°G for 20 sec, 72^G for 1.5 min, and 72*=*G for 7 min. Gycles 2 
through 4 were repeated 35 times. 

A 1.2kb PGR fi-agment was isolated and cloned into the pCRH-TOPO vector 
(Invitrogen) and sequenced using the ABI Big Dye Terminator kit (P.E. Biosystem). 
See^ SEQ.ID.NO.:!. The putative amino acid sequence for RUP8 is set forth in 
SEQ.ID.NO. :2. 

b. hRUP9 (Seq. Id. Nos. 3 & 4) 

The disclosed human RUP9 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AG011375 was identified as a human genomic sequence from chromosome 
5. The full length RUP9 was cloned by PGR using primers: 
5'-GAAGCTGTGAAGAGTGATGC-3' {SEQ.ID.NO.:43; sense), 
5'-GTCAGCAATATTGATAAGCAGCAG-3' (SEQ.ID.NO. :44; antisense) 
and human genomic DNA (Promega) as a template. Taq Plus Precision polymerase 
(Stratagene) was used for the amplification in a lOOjil reaction with 5% DMSO by the 
following cycle with step 2 to step 4 repeated 35 times: 94°G for 1 minute; 94^*0 for 
30 seconds; 56°G for 30 seconds; 72°G for 2 minutes; 72*=*G for 5 minutes. 

A 1 .3 Kb PGR fragment was isolated and cloned into the pGRH-TOPO vector 
(Invitrogen) fi-om 1% agarose gel and completely sequenced using the ABI Big Dye 
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Terminator kit (P.E. Biosystem). See, SEQ.ID.NO.:3. The putative amino acid 
sequence for RUP8 is set forth in SEQ.rD.NO.:4. The sequence of RUP9 clones isolated 
from human genomic DNA matched with the sequence obtained from data base, 
c. hRUPl 0 (Seq. Id. Nos. 5 & 6) 
5 The disclosed human RUPIO was identified based upon the use of GenBank 

database information. While searching the database, a cDNA clone with accession 
number AC008754 was identified as a human genomic sequence from chromosome 
19. The full length RUPIO was cloned by RT-PCR using primers: 
5'-CCATGGGGAACGATTCTGTCAGCTACG-3' (SEQ.ID.NO.:45; sense) and 
10 5'-GCl ATG^Cl G/V^GCCAGTCTTGTG-3' (SEQ.ID.NO.:46; antisense) 

and human IcukocNic Marathon-Ready cDNA (Clontech) as a template. Advantage 
cDNA polvTTicrasc (Clontech) was used for the amplification in a 50\x\ reaction by the 
following cycle wuh step 2 to step 4 repeated 35 times: 94^C for 30 seconds; 94°C 
for 10 seconds; 62°C for 20 seconds; 72''C for 1.5 minutes; 72*^0 for 7 minutes. A 1.0 
15 Kb PGR fragment was isolated and cloned into the pCRII-TOPO vector (Invitrogen) 
and completely sequenced using the ABI Big Dye Terminator kit (P.E. Biosystem). 
The nucleic acid sequence of the novel human receptor RUPIO is set forth in 
SEQ.ID.N0.:5 and the putative amino acid sequence thereof is set -forth in 
SEQ,ID.NO.:6. 

20 

d. hRUPl 1 (Seq. Id. Nos. 7 <& 8) 

The disclosed human RUPll was identified based upon the use of GenBank 
database information. While searching the database, a cDNA clone with accession 
25 number AC013396 was identified as a human genomic sequence from chromosome 2. 
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The full length RUPl 1 was cloned by PCR using primers: 
5'-CCAGGATGTTGTGTCACCGTGGTGGC-3' (SEQ.ID.NO.:47; sense), 
5'-CACAGCGCTGCAGCCCTGCAGCTGGC-3' (SEQ.ID.NO.:48; antisense) 
and human genomic DNA (Clontech) as a template. TaqPlus Precision DNA 
5 polymerase (Stratagene) was used for the amplilBcation in a 50^1 reaction by the 
following cycle with step 2 to step 4 repeated 35 times: 94°C for 3 minutes; 94°C for 20 
seconds; 6TC for 20 seconds; 72°C for 1.5 minutes; 72^*0 for 7 minutes. A 1.3 Kb PCR 
fragment was isolated and cloned into the pCRH-TOPO vector (Invitrogen) and 
completely sequenced using the ABI Big Dye Terminator kit (P.E. Biosystem). The 

10 nucleic acid sequence of the novel human receptor RUPll is set forth in SEQ.ID.NO.;7 
and the putative amino acid sequence thereof is set forth in SEQ.ID.NO.:8. 
e. bRUP12 (Seq. Id. Nos, 9 & 10) 
The disclosed human RXJP12 was identified based upon the use of GenBank 
database. While searching the database, a cDNA clone with accession number 

15 AP000808 was identified to encode a new GPCR, having significant homology with rat 
RTA and human masl oncogene GPCRs. The fiill length RUP12 was cloned by PCR 
using primers: 

5'-CTTCCTCTCGTAGGGATGAACCAGAC-3' (SEQ.ID.NO.:49; sense) 
5'-CTCGCACAGGTGGGAAGCACCTGTGG-3' (SEQ.ID.NO.:50; antisense) 
20 and human genomic DNA (Clontech) as template. TaqPlus Precision DNA polymerase 
(Stratagene) was used for the amplification by the following cycle with step 2 to step 4 
repeated 35 times: 94°C for 3 min; 94°C for 20 sec; 65°C for 20sec; 72°C for 2 min and 
72°C for 7 min. A l.Okb PCR fi-agment was isolated and cloned into the pCRII-TOPO 
vector (Invitrogen) and completely sequenced using the ABI Big Dye Terminator kit 
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(P.E. Biosystem) (see, SEQ.ro.NO.:9 for nucleic acid sequence and SEQ.ID.NO.:10 for 
deduced amino acid sequence). 

f. hRUP13 (Seq. Id. Nos, 11 & 12) 

The disclosed human RUP13 was identified based upon the use of 
5 GenBank database. While searching the database, a cDNA clone with accession number 
AGO 11780 was identified to encode a new GPCR, having significant homology with 
GPCR fish GPRX-ORYLA. The fiill length RUP13 was cloned by PGR using primers: 
5'-GCGTGTGACAGGAGGTACCCTGG-3' (SEQ.ID.NO.:51; sense) 
5'.CATATCCCTCCGAGTGTCCAGCGGC-3' (SEQ.ID.NO.:52; antisense) 
10 and human genomic DNA (Clontech) as template. TaqPlus Precision DNA polymerase 
(Stratagene) was used for the amplification by the following cycle with step 2 to step 4 
repeated 35 times: 94°C for 3 min; 94°C for 20 sec; 65*^C for 20sec; 72^C for 2 min and 
72*'C for 7 min. A 1.35kb PGR fi-agment was isolated and cloned into the pCRH-TOPO 
vector (Invitrogen) and completely sequenced using the ABI Big Dye Terminator kit 
15 (P.E. Biosystem) (see, SEQ.ID.NO.:ll for nucleic acid sequence and SEQ.ID.NO.:12 
for deduced amino acid sequence). 

g. hRUP14 (Seq, Id, Nos. 13 & 14) 
The disclosed human RUP14 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
20 Number AL137118 was identified as a human genomic sequence fi-om chromosome 
13. The full length RUP14 was cloned by PGR using primers: 
5'-GCATGGAGAGAAAATTTATGTCCTTGCAACC-3' (SEQ.ID.NO.:53; sense) 
5'-CAAGAACAGGTCTCATGTAAGAGCTCC-3' (SEQ.ID.NO.:54; antisense) 
and human genomic DNA (Promega) as a template. Taq Plus Precision polymerase 
25 (Stratagene) and 5% DMSO were used for the amplification by the following cycle 
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with step 2 and step 3 repeated 35 times: 94*='C for 3 minute; 94'='C for 20 seconds; 
58°C for 2 minutes; 72^C for 10 minutes. 

A 1.1 Kb PCR fragment was isolated and cloned into the pCRII-TOPO vector 
(Invitrogen) and completely sequenced using the ABI Big Dye Terminator kit (P.E. 
5 Biosysiem) {see, SEQ.E).NO.;13 for nucleic acid sequence and SEQ.ID.NO.:14 for 
deduced amino acid sequence). The sequence of RUP14 clones isolated from human 
genomic DNA matched with the sequence obtained from database. 

h. hRUP15(Seq.Id.Nos.l5&16) 

The disclosed human RUP15 was identified based upon the use of GeneBank 
10 database mfomiation. While searching the database, a cDNA clone with Accession 
Number ACOlMoS was identified as a human genomic sequence. The full length 
RXJP 1 5 \^ as cloned by PCR using primers: 

5'-GCTGTTGCX^ATGACGTCCACCTGCAC-3' (SEQ.ID.NO.:55; sense) 
5'-GGACAGTTCAAGGTTTGCCTTAGAAC-3' (SEQ.ID.NO.:56; antisense) 

15 and human genomic DNA (Promega) as a template. Taq Plus Precision polymerase 
(Stratagcnc) was used for the amphfication by the following cycle with step 2 to 4 
repeated 35 limes: 94''C for 3 minute; 94°C for 20 seconds; 65°C for 20 seconds; 
72°C for 2 minutes and 72*^0 for 7 minutes. 

A 1.5 Kb PCR fragment was isolated and cloned into the pCRU-TOPO vector 

20 (Invitrogen) and completely sequenced using the ABI Big Dye Terminator kit (P.E. 
Biosystem). See, SEQ.ID.NO,:15 for nucleic acid sequence and SEQ.ID.N0.:16 for 
deduced amino acid sequence. The sequence of RUP15 clones isolated from human 
genomic DNA matched with the sequence obtained from database. 

i. hRUP16 (Seq. Id. Nos, 17 & 18) 
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The disclosed human RUP16 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AL136106 was identified as a human genomic sequence from chromosome 
13. The full length RUPl 6 was cloned by PGR using primers: 
5 5 '-CTTTCGATACTGCTCGTATGCTC-3' (SEQ.ID.NO.:57; sense, 5' of initiation codon), 

5'-GTAGTCCACTGAAAGTCCAGTGATCC-3' (SEQ.ID.NO,:58; antisense, 3' of stop codon) 
and human skeletal muscle Marathon-Ready cDNA (Clontech) as template. Advantage 
cDNA polymerase (Clontech) was used for the amplification in a 50ul reaction by the 
following cycle with step 2 to 4 repeated 35 times: 94''C for 30 seconds; 94''C for 5 
1 0 seconds; 69''C for 1 5 seconds; 72°C for 1 minute and 72''C for 5 minutes. 

A 1.1 Kb PGR fi-agment was isolated and cloned into the pCRII-TOPO vector 
(Inviirogen) and completely sequenced using the T7 sequenase kit (Amsham). See, 
SEQ.ID.NO.:17 for nucleic acid sequence and SEQ.ID.NO.:18 for deduced amino acid 
sequence. The sequence of RUPl 6 clones matched with four unordered segments of 
15 AL 1 36 1 06, indicating that the RUP 1 6 cDN A is composed of 4 exons. 
j. hRUPl? (Seq. Id, Nos. 19 & 20) 
The disclosed human RUP17 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AG023078 was identified as a human genomic sequence firom chromosome 
20 11. The full length RUP 17 was cloned by PGR using primers: 

5 ^-TTTGTG AGC ATG G ATCCAACCATCTC-3 ' (SEQ.ID.NO.:59; sense, containing initiation 
codon) 

5'-CTGTCTGACAGGGGAGAGGCTCTTC-3' (SEQ.ID.NO.:60; antisense, 3' of stop codon) 
and human genomic DNA (Promega) as template. Advantage cDNA polymerase mix 
25 (Glontech) was used for the amplification in a lOOul reaction with 5% DMSG by the 
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following cycle with step 2 to 4 repeated 30 times: 94°C for 1 min; 94°C for 1 5 sec; 
6TC for 20 sec; 72°C for 1 min and 30 sec; and 72^C for 5 min. 

A 970bp PGR fragment was isolated from 1% agarose gel and cloned into the 
pCRII-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
5 Termiantor Kit (P,E. Biosystem). See, SEQ.ID.NO.:19 for nucleic acid sequence and 
SEQ,ID.NO.:20 for deduced amino acid sequence. 

k. hRUPlS (Seq, Id. Nos. 21 & 22) 
The disclosed human RUPIS was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
10 Number AC008547 was identified as a human genomic sequence from chromosome 
5. The full length RUPl 8 was cloned by PCR using primers: 
5'-GGAACTCGTATAGACCCAGCGTCGCTCC-3' (SEQ.ID.NO.:61; sense, 5' of the 
initiation codon), 

5'-GGAGGTTGCGCCTTAGCGACAGATGACC-3' (SEQ.ID.NO.:62; antisense, 3' of stop 
1 5 codon) 

and human genomic DNA (Promega) as template. TaqPlus precision DNA 
polymerase (Stratagene) was used for the amplification in a lOOul reaction with 5% 
DMSO by the following cycle with step 2 to 4 repeated 35 times: 95*^Cibr 5 min; 
95°C for 30 sec; 65°C for 30 sec; 72''C for 2 min; and 72°C for 5 min. 
20 A 1.3kb PCR fragment was isolated from 1% agarose gel and cloned into the 

pCRII-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Termiantor Kit (P,E, Biosystem). See, SEQ.ID.NO.:21 for nucleic acid sequence and 
SEQ.ID.NO.:22 for deduced amino acid sequence, 
1. hRUP19 (Seq. Id. Nos. 23 & 24) 
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The disclosed human RUP19 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AC026331 was identified as a human genomic sequence from chromosome 
12. The full length RUPl 9 was cloned by PGR using primers: 
5 5 '-CTGCACCCGGACACTTGCTCTG-3 ' (SEQ.E).NO.:63; sense, 5 ' of initiation codon), 

5 '-GTCTGCTrGT TCA GTGCCACTCAAC-3 ' (SEQ.ID.NO.:64; antisense, containing the stop 
codon) 

and human genomic DNA (Promega) as template. TaqPlus Precision DNA 
polymerase (Stratagene) was used for the amplification with 5% DMSO by the 
10 following cycle with step 2 to 4 repeated 35 times: 94*^0 for 1 min; 94°C for 15 sec; 
70°C for 20 sec; 72^*0 for 1 min and 30 sec; and 72°C for 5 min. 

A Llkp PGR firagment was isolated fi-om 1% agarose gel and cloned into the 
pCRH-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Termiantor Kit (P.E, Biosystem). See, SEQ.ID.NO.:23 for nucleic acid sequence and 
1 5 SEQ.ID.NO.:24 for deduced amino acid sequence. 

m. hRUP20 (Seq, Id. Nos. 25 & 26) 
The disclosed human RIJP20 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AL161458 was identified as a human genomic sequence fi-om chromosome 
20 1 . The fill! length RUP20 was cloned by PGR using primers: 

5'-TATCTGCAATTCTATTCTAGCTCCTG-3' (SEQ.ID.NO-:65; sense, 5' of initiation codon), 
5'-TGTCCCTAATAAAGTCACATGAATGC-3' (SEQ.ID.NO.:66; antisense, 3' of stop codon) 
and human genomic DNA (Promega) as template. Advantage cDNA polymerase mix 
(Clonetech) was used for the amplification with 5% DMSO by the following cycle with 
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Step 2 to 4 repeated 35 times: 94^C for 1 min; 94°C for 15 sec; 60°C for 20 sec; 72°C 
for 1 min and 30 sec; and 72®C for 5 min. 

A 1.0 kp PGR fragment was isolated from 1% agarose gel and cloned into the 
pCRII-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
5 Temiiantor Kit (P.E. Biosystem). See^ SEQ.BD.NO.;25 for nucleic acid sequence and 
SEQ.ID.NO.:26 for deduced amino acid sequence. 

n. hRUP21 (Seq. Id. Nos. 27 & 28) 
The disclosed human RUP21 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
10 Number AC026756 was identified as a human genomic sequence from chromosome 
13. The full length RUP21 was cloned by PGR using primers: 
5 - GG.AGACAACCATGAATGAGCCAC-3' (SEQ.ID.NO.:67; sense) 
5 - TATTTCAAGGGTTGTTTGAGTAAC-3' (SEQ.ID.NO.:68; antisense) 
and human genomic DNA (Promega) as template. Taq Plus Precision polymerase 
1 5 (Siraiagcne) was used for the ampHfication in a lOOul reaction with 5% DMSO by the 
following cycle with step 2 to 4 repeated 30 times: 94^C for 1 min; 94''C for 15 sec; 
55^*0 for 20 sec; 72°C for 1 min and 30 sec; and 72°C for 5 min. 

A 1,014 bp PGR fragment was isolated from 1% agarose gel and cloned into the 
pCRII-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
20 Termianior Kit (P.E. Biosystem), See, SEQ.ID.NO.:27 for nucleic acid sequence and 
SEQ.rD.NO,:28 for deduced amino acid sequence. 

o. hRUP22 (Seq. Id. Nos. 29 & 30) 
The disclosed human RUP22 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
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Number AC027026 was identified as a human genomic sequence from chromosome 
11. The full length RUP22 was cloned by PGR using primers: 
5'-GGCACCAGTGGAGGnTTCTGAGCATG-3' (SEQ.ID.NO.:69; sense, containing 
initiation codon) 

5 5'.CTGATGGAAGTAGAGGCrGTCCATCTC-3' (SEQ.ID.NO.:70; antisense, 3' of stop 
codon) 

and human genomic DNA (Promega) as template. TaqPlus Precision DNA polymerase 
(Stratagene) was used for the amplification in a lOOul reaction with 5% DMSO by the 
following cycle with step 2 to 4 repeated 30 times: 94°C, 1 minutes 94°C, 15 seconds 
1 0 55°C, 20 seconds 72°C, 1 .5 minute 72°C, 5 minutes. 

A 970bp PGR fragment was isolated from 1% agarose gel and cloned into the 
pCRH-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Termiantor Kit (P.E. Biosystem). See, SEQ.ID.NO.:29 for nucleic acid sequence and 
SEQ.ID.no. :30 for deduced amino acid sequence. 
1 5 p. hRUP23 (Seq. Id. Nos. 31 & 32) 

The disclosed human RUP23 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AG007104 was identified as a human genomic sequence from chromosome 
4. The full length RIJP23 was cloned by PGR using primers: 
20 5'-CCTGGCGAGCCGCTAGCGCCATG-3' (SEQ.ID.NO.:71 ; sense, ATG as the initiation 
codon), 

5'-ATGAGCCCTGCCAGGCCCTCAGT-3' (SEQ.ID.NO.:72; antisense, TCA as the stop 
codon) 

and human placenta Marathon-Ready cDNA (Glontech) as template. Advantage cDNA 
25 polymerase (Glontech) was used for the amplification in a 50ul reaction by the following 
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cycle with step 2 to 4 repeated 35 times: 95°C for 30 sec; 95°C for 15 sec; 66^C for 20 
sec; 72^C for 1 min and 20 sec; and 72°C for 5 min. 

A 1.0 kb PGR fragment was isolated and cloned into the pCRII-TOPO vector 
(Invitrogen) and completely sequenced using the ABI Big Dye Terminator Kit (P.E. 
Biosystem). See, SEQ.ID.NO.:31 for nucleic acid sequence and SEQ.ID.NO,:32 for 
deduced amino acid sequence. 

q, hRUP24 (Seq. Id. Nos. 33 & 34) 
The disclosed human RUP25 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AC026331 was identified as a human genomic sequence from chromosome 
12. The full length RUP25 was cloned by PGR using primers: 
5*-GCTGGAGCATTCACTAGGCGAG-3' (SEQ,ID.NO,:73; sense, 5 'of initiation codon), 
S'-AGATCCTGGTTCTTGGTGACAATG-S' (SEQ.ID.NO.:74; antisense, 3' of stop codon) 
and human genomic DNA (Promega) as template. Advantage cDNA polymerase mix 
(Clontech) was used for the amplification with 5% DMSO by the following cycle with 
step 2 to 4 repeated 35 times: 94T for 1 minute; 94°C for 15 seconds; 56°C for 20 
seconds 72'*C for 1 minute 30 seconds and 72°C for 5 minutes. 

A 1.2kb PGR fragment was isolated from 1% agarose gel and cloned into the 
pCRn-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Termiantor Kit (P.E. Biosystem). See, SEQ.ID.NO.:33 for nucleic acid sequence and 
SEQ,ID.NO.:34 for deduced amino acid sequence. 

r. hRUP25 (Seq. Id. Nos. 35 & 36) 
The disclosed human RUP25 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
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Number AC026331 was identified as a human genomic sequence from chromosome 
12. The full length RUP25 was cloned by PGR using primers: 
5'-GCTGGAGCATTCACTAGGCGAG-3' (SEQ.ID.NO.:75; sense, 5'of initiation codon), 
5'-AGATCCTGGTTCTTGGTGACAATG-3' (SEQ.ID.NO.:76; antisense, 3' of stop codon) 
5 and human genomic DNA (Promega) as template. Advantage cDNA polymerase mix 
(Clontech) was used for the amplification with 5% DMSO by the following cycle with 
step 2 to 4 repeated 35 times: 94°C for 1 minute; 94°C for 15 seconds; 56°C for 20 
seconds 72°C for 1 minute 30 seconds and 72°C for 5 minutes. 

A 1.2kb PGR fi-agment was isolated fi-om 1% agarose gel and cloned into the 
10 pGRH-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Termiantor Kit (P.E. Biosystem). See, SEQ.ID.NO.:35 for nucleic acid sequence and 
SEQ.ID.NO.:36 for deduced amino acid sequence. 

s. hRUP26 (Seq. Id. Nos. 37 & 38) 
The disclosed human RUP26 was identified based upon the use of GeneBank 
15 database information. While searching the database, a cDNA clone with Accession 
Number AG023040 was identified as a human genomic sequence from chromosome 
2. The full length RIIP26 was cloned by RT-PGR using RUP26 specific primers: 
5'-AGCCATCCCTGCCAGGAAGCATGG-3' (SEQ.ID.NO.:77; sense, containing iniliation 
codon) 

20 5'-CCAGACTGTGGACTCAAGAACTCTAGG-3' (SEQ.ID.NO.:78; antisense, containing stop 
codon) 

and human pancreas Marathon - Ready cDNA (Glontech) as template. Advantage cDNA 
polymerase mix (Glontech) was used for the amplification in a lOOul reaction with 5% 
DMSO by the following cycle with step 2 to 4 repeated 35 times: 94°G for 5 minute; 
25 95°G for 30 seconds; 65°G for 30 seconds 72°G for 2 minute and 72°G for 5 minutes. 
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A 1.1 kb PGR fragment was isolated from 1% agarose gel and cloned into the 

pCRII-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 

Termiantor Kit (P.E. Biosystem), See, SEQ.ID,NO.:37 for nucleic acid sequence and 

SEQ.ID.NO.:38 for deduced amino acid sequence. 

5 t. hRUP27 (Seq. Id. Nos. 39 & 40) 

The disclosed human RUP27 was identified based upon the use of GeneBank 

database information. While searching the database, a cDNA clone with Accession 

Number AG027643 was identified as a human genomic sequence from chromosome 

12. The full length RIJP27 was cloned by PGR using RUP27 specific primers: 

1 0 5 '-AGTCCACGAACAATGAATCCATTTCATG-S ' (SEQ.ID.NO.:79; sense, containing 
initiation codon), 

5'-ATCATGTCTAGACTCATGGTGATCC-3' (SEQ.ID.NO.:80; antisense, 3' of stop codon) 
and the human adult brain Marathon-Ready cDNA (Clontech) as template. Advantage cDNA 
polymerase mix (Clontech) was used for the amplification in a 50^x1 reaction with 5% 
15 DMSO by the following cycle with step 2 to 4 repeated 35 times: 94°G for 1 minute; 
94°G for 10 seconds; 58°C for 20 seconds 72°C for 1 minute 30 seconds and 72'*C for 5 
minutes. 

A 1.1 kb PGR fragment was isolated from 1% agarose gel and cloned into the 
pGRH-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
20 Termiantor Kit (P.E. Biosystem). See, SEQ.ID.NO.:35 for nucleic acid sequence and 
SEQ,ID.NO.:36 for deduced amino acid sequence. The sequence of RUP27 cDNA 
clone isolated from human brain was determined to match with five imordered segments 
of AG027643, indicating that the RUP27 cDNA is composed of 5 exons. 
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Example 2 

Preparation of Non-Endogenous, Constitutively Activated Gpcrs 

Those skilled in the art are credited with the ability to select techniques for 
mutation of a nucleic acid sequence. Presented below are approaches utilized to 

5 create non-endogenous versions of several of the human GPCRs disclosed above. 
The mutations disclosed below are based upon an algorithmic approach whereby the 
16^^ amino acid (located in the IC3 region of the GPCR) jfrom a conserved proline (or 
an endogenous, conservative substitution therefore) residue (located in the TM6 
region of the GPCR, near the TM6/IC3 interface) is mutated, preferably to an alanine, 

10 histidine, arginine or lysine amino acid residue, most preferably to a lysine amino acid 
residue. 

1. Transformer Site-Directed ™ Mutagenesis 

Preparation of non-endogenous human GPCRs may be accomplished on human 
GPCRs using Transformer Site-Directed™ Mutagenesis Kit (Clontech) according to the 
15 manufacturer instructions. Two mutagenesis primers are utilized, most preferably a 
lysine mutagenesis oligonucleotide that creates the lysine mutation, and a selection 
marker oligonucleotide. For convenience, the codon mutation to be incorporated into the 
human GPCR is also noted, in standard form (Table D): 

20 TABLE D 



Receptor Identifier 


Codon Mutation 


hRUP8 


V274K 


hRUP9 


T249K 


hRUPlO 


R232K 


hRUPll 


M294K 


hRUP12 


F220K 


hRUP16 


A238K 
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nKUJrlv 


Y215K 


hRUPlS 


L294K 


hRUP19 


T219K 


hRUP20 


K248A 




K248H 




K248R 


hRUP21 


R240K 


hRUP22 


Y222K 


hRUP24 


A245K 


hRUP25 


I230K 


hRUP26 


V285K 


hRUP27 


T248K 



2. QuikChange™ Site-Directed™ Mutagenesis 

Preparation of non-endogenous human GPCRs can also be accomplished by 
using QuikChange™ Site-Directed™ Mutagenesis Kit (Stratagene, according to 
manufacturer's instructions). Endogenous GPCR is preferably used as a template and 
two mutagenesis primers utilized, as well as, most preferably, a lysine mutagenesis 
oligonucleotide and a selection marker oligonucleotide (included in kit). For 
convenience, the codon mutation incorporated into the novel human GPCR and the 
respective ohgonucleotides are noted, in standard form (Table E): 



TABLE E 



Receptor 
Identifier 


Codon 
Mutation 


5'-3'' orientation (sense), 
(SEQ,ED.NO.) mutation 
underlined 


5'-3' orientation 
(antisense) (SEQ.ID.NO.) 


Cycle Conditions 
Min C), Sec («) 
Cycles 2-4 
repeated 16 times 


hRUP13 


A268K 


GGGGAGGGAAAGCAA 

AGGTGGTCCTCCTGG 

(81) 


CCAGGAGAACCACCT 

TTGCrnCCCTCCCC 

(82) 


98° for T 
98** for 30" 
56**C for 30" 
72^ for 1 r 40" 
72*' for 5' 


hRUPM 


L246K 


CAGGAAGGCAAAGAC 
CACCATCATCATC (85) 


GATGATGATGGTGGT 
CTTTGCCTTCCTG (86) 


98*'for2^ 
98° for 30" 
55°Cfor30" 
72° for 1 r 40" 
72° for 5' 



40 



BNSDOCID; <WO 0136^71 A2_(_> 



5 



10 



wo 01/36471 



PCT/USOO/31509 



hRUP15 


A398K 


CCAGTGCAAAGCTAAG 
AAAGTGATCTTC (89) 


GAAGATCACTTTCTTA 
GCn i GCACTGG (90) 


98° for 2 » 
98° for 30" 
55°C for 30" 
72° for 1 r 40" 
72° for 5' 


hRUP23 


W275K 


GCCGCCACCGCGCCAA 
GAGGAAGATTGGC (93) 


GCCAAl CTTCCTCTTG 

GCGCGGTGGCGGC 

(94) 


98° for V 
98° for 30" 
56°C for 30" 
72° for ir 40" 
72° for 5' 



The non-endogenous human GPCRs were then sequenced and the derived and 
verified nucleic acid and amino acid sequences are hsted in the accompanying 
5 "Sequence Listing" appendix to this patent document, as summarized in Table F 
below: 



TABLE F 



Non Endogenous Human 
GPCR 


Nucleic Acid Sequence Listing 


Anuno Acid Sequence 
Listing 


hRUP13 


SEQ.ID.NO.:83 


SEQ.ID.NO.:84 


hRUP14 


SEQ.ID.NO.:87 


SEQ.ID.NO.:88 


hRUP15 


SEQ.rD.NO.:91 


SEQ.ID.NO.:92 


hRUP23 


SEQ.ID.NO.:95 


SEQ,ID.NO.:96 



Example 3 ^ 
Receptor Expression 

Although a variety of cells are available to the art for the expression of 
proteins, it is most preferred that mammalian cells be utilized. The primary reason for 
this is predicated upon practicalities, Le,, utilization of, e,g., yeast cells for the 
expression of a GPCR, while possible, introduces into the protocol a non-mammalian 
cell which may not (indeed, in the case of yeast, does not) include the receptor- 
coupling, genetic-mechanism and secretary pathways that have evolved for 
mammalian systems - thus, results obtained in non-mammalian cells, while of 
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potential use, are not as preferred as that obtained from mammalian cells. Of the 
mammalian cells, COS-7, 293 and 293T cells are particularly preferred, although the 
specific mammalian cell utilized can be predicated upon the particular needs of the 
artisan. 

5 a. Transient Transfection 

On day one, 6x10*^/ 10 cm dish of 293 cells well were plated out. On day two, 
two reaction tabes were prepared (the proportions to follow for each tube are per plate): 
tube A was prepared by mixing 4\ig DNA {e.g., pCMV vector; pCMV vector with 
receptor cDNA, etc.) in 0.5 ml serum free DMEM (Gibco BRL); tube B was prepared by 

10 mixing 24)al hpofcciamine (Gibco BRL) in 0.5ml serum free DMEM. Tubes A and B 
were admixed by inversions (several times), followed by incubation at room temperature 
for 30-45mm. The admixture is referred to as the 'transfection mixture". Plated 293 
cells were u ashed with IXPBS, followed by addition of 5 ml serum free DMEM. 1 ml 
of the iransfcchon mixture were added to the cells, followed by incubation for 4hrs at 

15 37**C/5**/o CO:- The transfection mixture was removed by aspiration, followed by the 
addition of 10ml of DMEM/10% Fetal Bovine Serum. Cells were incubated at 37°C/5% 
CO2. After 48hr incubation, cells were harvested and utilized for analysis, 
b. Stable Cell Lines: Gs Fusion Protein 

Approximately 12x10*^ 293 cells are plated on a 15cm tissue culture plate. 

20 Grown in DME High Glucose Medium containing ten percent fetal bovine serum and 
one percent sodium pyruvate, L-glutamine, and anti-biotics. Twenty- four hours 
following plating of 293 cells to -80% confluency, the cells are transfected using 12|j.g 
of DNA. Tlie \2\ig of DNA is combined with 60ul of lipofectamine and 2mL of DME 
High Glucose Medium without serum. The medium is aspirated from the plates and the 

25 cells are washed once with medium without serum. The DNA, lipofectamine, and 
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medium mixture is added to the plate along with lOmL of medium without serum. 
Following incubation at 37 degrees Celsius for fotir to five hours, the medium is 
aspirated and 25ml of medium containing serum is added. Twenty- four hours following 
transfection, the medium is aspirated again, and fi-esh medium with serum is added. 

5 Forty-eight hours following transfection, the medium is aspirated and medium with 
serum is added containing geneticin (G418 drug) at a final concentration of SOO^ig/mL. 
The transfected cells now undergo selection for positively transfected cells containing 
the G418 resistant gene. The medium is replaced every four to five days as selection 
occurs. During selection, cells are grown to create stable pools, or split for stable clonal 

10 selection. 

Example 4 

ASSAYS For determination of Constitutive Activity 
OF Non-Endogenous GPCRs 

15 A variety of approaches are available for assessment of constitutive activity of 

the non-endogenous human GPCRs. The following are illustrative; those of ordinary 

skill in the art are credited with the ability to determine those techniques that are 

preferentially beneficial for the needs of the artisan. 

1 . Membrane Binding Assays: [^^SJGTPyS Assay 

20 

When a G protein-coupled receptor is in its active state, either as aTesult of 
ligand binding or constitutive activation, the receptor couples to a G protein and 
stimulates the release of GDP and subsequent binding of GTP to the G protein. The 
alpha subunil of the G protein-receptor complex acts as a GTPase and slowly hydrolyzes 
25 the GTP to GDP, at which point the receptor normally is deactivated. Constitutively 
activated receptors continue to exchange GDP for GTP. The non-hydrolyzable GTP 
analog, [^^S]GTPyS, can be utilized to demonstrate enhanced binding of [^^S]GTPyS to 
membranes expressing constitutively activated receptors. The advantage of using 
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["SJGTPyS binding to measure constitutive activation is that: (a) it is generically 
applicable to all G protein-coupled receptors; (b) it is proximal at the membrane surface 
making it less likely to pick-up molecules which affect the intracellular cascade. 

The assay utilizes the ability of G protein coupled receptors to stimulate 
5 [^^SJGTPyS binding to membranes expressing the relevant receptors. The assay can, 
therefore, be used in the direct identification method to screen candidate compounds to 
known, orphan and constitutively activated G protein-coupled receptors. The assay is 
generic and has application to drug discovery at all G protein-coupled receptors. 

The [^^S]GTPyS assay was incubated in 20 mM HEPES and between 1 and 
10 about 20mM MgCh (this amount can be adjusted for optimization of results, although 
20mM is preferred) pH 7.4, binding buffer with between about 0.3 and about 1.2 nM 
[^^SJGTPyS (this amount can be adjusted for optimization of results, although 1.2 is 
preferred ) and 12.5 to 75 ng membrane protein (e.g. 293 cells expressing the Gs Fusion 
Protein; this amount can be adjusted for optimization) and 10 )aM GDP (this amount can 
15 be changed for optimization) for 1 hour. Wheatgerm agglutinin beads (25 ^il; 
Amersham) were then added and the mixture incubated for another 30 minutes at room 
temperature. The tubes were then centrifliged at 1500 x g for 5 minutes at room 
temperature and then counted in a scintillation counter. 
2. Adenylyl Cyclase 
20 A Flash PlateTM Adenylyl Cyclase kit (New England Nuclear; Cat. No. 

SMP004A) designed for cell-based assays can be modified for use with crude plasma 
membranes. The Flash Plate wells can contain a scintillant coating which also contains a 
specific antibody recognizing cAMP. The cAMP generated in the wells can be 
quantitated by a direct competition for binding of radioactive cAMP tracer to the cAMP 
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antibody. The following serves as a brief protocol for the measurement of changes in 
cAMP levels in whole cells that express the receptors, 

Transfected cells were harvested approximately twenty four hours after transient 
transfection. Media is carefully aspirated off and discarded. 10ml of PBS is gently 
5 added to each dish of cells followed by careful aspiration. 1ml of Sigma cell 
dissociation buffer and 3ml of PBS are added to each plate. Cells were pipeted off the 
plate and the cell suspension was collected into a 50ml conical centrifuge tube. Cells 
were then centrifuged at room temperature at 1,100 rpm for 5 min. The cell pellet was 
carefully re-suspended into an appropriate volume of PBS (about 3ml/plate). The cells 

10 were then counted using a hemocytometer and additional PBS was added to give the 
appropriate number of cells (with a final volume of about 50 fil/well). 

cAMP standards and Detection Buffer (comprising 1 piCi of tracer [^^^I cAMP 
(50 ^l] to 1 1 ml Detection Buffer) was prepared and maintained in accordance with the 
manufacturer's instructions. Assay Buffer was prepared fresh for screening and 

15 contained 50jal of Stimulation Buffer, 3ul of test compound (12uM final assay 
concentration) and 50p.l cells, Assay Buffer was stored on ice imtil utilized. The assay 
was initiated by addition of 50|a.l of cAMP standards to appropriate wells followed by 
addition of 50ul of PBS A to wells H- 11 and HI 2. 50^1 of Stimulation Buffer was added 
to all wells. DMSO (or selected candidate compounds) was added to appropriate wells 

20 using a pin tool capable of dispensing 3jj.l of compound solution, with a final assay 
concentration of llysM test compoimd and 100|al total assay volume. The cells were 
then added to the wells and incubated for 60 min at room temperature. 100|j.l of 
Detection Mix containing tracer cAMP was then added to the wells. Plates were then 
incubated additional 2 hours followed by counting in a Wallac MicroBeta scintillation 
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counter. Values of cAMP/well were then extrapolated from a standard cAMP curve 
which was contained within each assay plate. 

3, Cell-Based cAMP for Gi Coupled Target GPCRs 

TSHR is a Gs coupled GPCR that causes the accumulation of cAMP upon 
activation. TSKR will be constitutively activated by mutating amino acid residue 623 
(/.e., changing an alanine residue to an isoleucine residue). A Gi coupled receptor is 
expected to inhibit adenylyl cyclase, and, therefore, decrease the level of cAMP 
production, which can make assessment of cAMP levels challenging. An effective 
technique for measuring the decrease in production of cAMP as an indication of 
constitutive activation of a Gi coupled receptor can be accomplished by co-transfecting, 
most preferably, non-endogenous, constitutively activated TSHR (TSHR-A623I) (or an 
endogenous, constitutively active Gs coupled receptor) as a "signal enhancer" with a Gi 
linked target GPCR to establish a baseUne level of cAMP. Upon creating a non- 
endogenous version of the Gi coupled receptor, this non-endogenous version of the 
target GPCR is then co-transfected with the signal enhancer, and it is this material that 
can be used for screening. We will utilize such approach to effectively generate a signal 
when a cAMP assay is used; this approach is preferably used in the direct identification 
of candidate compounds against Gi coupled receptors. It is noted that for a Gi coupled 
GPCR, when this approach is used, an inverse agonist of the target GPCR will increase 
the cAMP signal and an agonist will decrease the cAMP signal. 

On day one, 2X10"* 293 and 293 cells/well will be plated out. On day two, two 
reaction tubes will be prepared (the proportions to follow for each tube are per plate): 
tube A will be prepared by mixing 2|ag DNA of each receptor transfected into the 
mammalian cells, for a total of 4|ag DNA (e.g., pCMV vector; pCMV vector with 
mutated THSR (TSHR-A623I); TSHR.A623I and GPCR, etc.) in 1.2ml serum free 
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DMEM (Irvine Scientific, Irvine, CA); tube B will be prepared by mixing 120)al 
lipofectamine (Gibco BRL) in l.2nil serum fi-ee DMEM. Tubes A and B will then be 
admixed by inversions (several times), followed by incubation at room temperature for 
30-45min, The admixture is referred to as the "transfection mixture". Plated 293 cells 
5 will be washed with IXPBS, followed by addition of 10ml serum jfree DMEM. 2.4ml of 
the transfection mixture will then be added to the cells, followed by incubation for 4hrs 
at 37*^0/5% CO2. The transfection mixture will then be removed by aspiration, followed 
by the addition of 25ml of DMEM/10% Fetal Bovme Serum. Cells will then be 
incubated at 3TCI5Vo CO2. After 24hr incubation, cells will then be harvested and 

1 0 utilized for analysis. 

A Flash Plate™ Adenylyl Cyclase kit (New England Nuclear; Cat. No. 
SMP004A) is designed for cell-based assays, however, can be modified for use with 
crude plasma membranes depending on the need of the skilled artisan. The Flash Plate 
wells will contain a scintillant coating which also contains a specific antibody 

15 recognizing cAMP. The cAMP generated in the wells can be quantitated by a direct 
competition for binding of radioactive cAMP tracer to the cAMP antibody. The 
following serves as a brief protocol for the measurement of changes in cAMP levels in 
whole cells that express the receptors. 

Transfected cells will be harvested approximately twenty four hours after 

20 transient transfection. Media will be carefiiUy aspirated off and discarded. 10ml of PBS 
v^U be gently added to each dish of cells followed by carefiil aspiration. 1ml of Sigma 
cell dissociation buffer and 3ml of PBS will be added to each plate. Cells will be pipeted 
off the plate and the cell suspension will be collected into a 50ml conical centrifuge tube. 
Cells will then be centrifuged at room temperature at 1,100 rpm for 5 min. The cell 

25 pellet will be carefiiUy re-suspended into an appropriate volume of PBS (about 
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3mVplate). The cells will then be counted using a hemocytometer and additional PBS is 
added to give the appropriate number of cells (with a final volume of about 
50Hl/weIl). 

cAMP standards and Detection Buffer (comprising 1 ^Ci of tracer [*^^ cAMP 
5 (50 ^1] to 11 mJ Detection Buffer) will be prepared and maintained in accordance with 
the manufacturer's instructions. Assay Buffer should be prepared fi-esh for screening 
and contained 50|li1 of Stimulation Buffer, 3ul of test compound (12uM final assay 
concentration) and 50fil cells. Assay Buffer can be stored on ice until utilized. The assay 
can be initiated by addition of 50^1 of cAMP standards to appropriate wells followed by 

10 addition of 50fil of PBSA to wells H- 11 and H12. 50ul of Stimulation Buffer will be 
added to all wells. Selected compounds {e,g,, TSH) will be added to appropriate wells 
using a pin tool capable of dispensing 3^1 of compound solution, with a final assay 
concentration of 12jiM test compound and lOOjil total assay volume. The cells will then 
be added to the weUs and incubated for 60 min at room temperature. 100^1 of Detection 

1 5 Mix containing tracer cAMP will then be added to the wells. Plates were then incubated 
additional 2 hours followed by counting in a Wallac MicroBeta scintillation counter. 
Values of cAMP/well will then be extrapolated from a standard cAMP curve which is 
contained within each assay plate. — 
4. Reporter-Based Assays 

20 a. Cre-Luc Reporter Assay (Gs-associated receptors) 

293 and 293T cells are plated-out on 96 well plates at a density of 2 x 10"* cells 
per well and were transfected using Lipofectamine Reagent (BRL) the following day 
according to manufacturer instructions. A DNA/lipid mixture is prepared for each 6- 
well transfection as follows: 260ng of plasmid DNA in lOO^il of DMEM were gently 

25 mixed with 2^1 of lipid in 100^1 of DMEM (the 260ng of plasmid DNA consisted of 
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200ng of a 8xCRE-Luc reporter plasmid, 50ng of pCMV comprising endogenous 
receptor or non-endogenous receptor or pCMV alone, and lOng of a GPRS expression 
plasmid (GPRS in pcDNA3 (Invitrogen)). The 8XCRE-Luc reporter plasmid was 
prepared as follows: vector SREF-P-gal was obtained by cloning the rat somatostatin 
5 promoter (-71/+51) at BglV-Hindlll site in the pPgal-Basic Vector (Clontech). Eight 
(8) copies of cAMP response element were obtained by PGR from an adenovirus 
template AdpCF126CCRE8 (see, 7 Human Gene Therapy 1883 (1996)) and cloned 
into the SRIF-p-gal vector at the Kpn-BglV site, resulting in the 8xCRE-p-gal 
reporter vector. The 8xCRE-Luc reporter plasmid was generated by replacing the 

10 beta-galactosidase gene in the 8xCRE-p-gal reporter vector with the luciferase gene 
obtained from the pGL3 -basic vector (Promega) at the Hindlll-BamHI site. 
Following 30 min. incubation at room temperature, the DNA/lipid mixture was 
diluted with 400 \xl of DMEM and 100|il of the diluted mixture was added to each 
well. 100 fil of DMEM with 10% PCS were added to each well after a 4hr incubation 

15 in a cell culture incubator. The following day the transfected cells were changed with 
200 fil/well of DMEM with 10% PCS. Eight (8) hours later, the wells were changed 
to 100 |al /well of DMEM without phenol red, after one wash with PBS. Luciferase 
activity were measured the next day using the LucLite™ reporter gene assay kit 
(Packard) following manufacturer instructions and read on a 1450 MicroBeta"^^ 

20 scintillation and luminescence counter (Wallac). 

b. API reporter assay (Gq-associated receptors) 
A method to detect Gq stimulation depends on the known property of Gq- 
dependent phospholipase C to cause the activation of genes containing API elements 

25 in their promoter. A Pathdetect™ AP-1 cis-Reporting System (Stratagene, Catalogue 
# 219073) can be utilized following the protocol set forth above with respect to the 
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CREB reporter assay, except that the components of the calcium phosphate precipitate 
were 410 ng pAPl-Luc, 80 ng pCMV-receptor expression plasmid, and 20 ng CMV- 
SEAP. 

c. Srf-Luc Reporter Assay (Gq- associated receptors) 
One method to detect Gq stimulation depends on the known property of Gq- 
dependent phospholipase C to cause the activation of genes containing serum 
response factors in their promoter. A Pathdetect^M SRF-Luc-Reporting System 
(Stratagene) can be utilized to assay for Gq coupled activity in, e.g., COS7 cells. 
Cells are transfected with the plasmid components of the system and the indicated 
expression plasmid encodmg endogenous or non-endogenous GPCR using a 
Mammalian Transfection™ Kit (Stratagene, Catalogue #200285) according to the 
manufacturer's instructions. Briefly, 410 ng SRF-Luc, 80 ng pCMV-receptor 
expression plasmid and 20 ng CMV-SEAP (secreted alkaline phosphatase expression 
plasmid; alkaline phosphatase activity is measured in the media of transfected cells to 
control for variations in transfection efficiency between samples) are combined in a 
calcium phosphate precipitate as per the manufacturer's instructions. Half of the 
precipitate is equally distributed over 3 wells in a 96-well plate, kept on the cells in a 
serum free media for 24 hours. The last 5 hours the cells are incubated-with 1 ^M 
Angiotensin, where indicated. Cells are then lysed and assayed for luciferase activity 
using a Luclite™ Kit (Packard, Cat. # 6016911) and "Trilux 1450 Microbeta" liquid 
scintillation and luminescence counter (Wallac) as per the manufacturer's 
instructions. The data can be analyzed using GraphPad Prism^M 2.0a (GraphPad 
Software Inc.). 
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d. Intracellular EP3 Accumulation Assay (Gq-associated 
receptors) 

On day 1, cells comprising the receptors (endogenous and/or non-endogenous) 
5 can be plated onto 24 well plates, usually 1x10^ cells/well (although his umber can be 
optimized. On day 2 cells can be transfected by firstly mixing 0.25 [ig DNA in 50 ^il 
serum free DMEM/well and 2 \x\ lipofectamine in 50 p.1 serumfree DMEM/well. The 
solutions are gently mixed and incubated for 15-30 min at room temperature. Cells are 
washed with 0.5 ml PBS and 400 ^il of serum free media is mixed with the transfection 

10 media and added to the cells. The cells are then incubated for 3-4 hrs at 37°C/5%C02 
and then the transfection media is removed and replaced with 1 ml/well of regular growth 
media. On day 3 the cells are labeled with "^H-myo-inositol. Briefly, the media is 
removed and the cells are washed with 0.5 ml PBS. Then 0.5 ml inositol-free/serum free 
media (GEBCO BRL) is added/well with 0.25 ^iCi of "^H-myo-inositol/ well and the cells 

15 are incubated for 16-18 hrs o/n at 37''C/5%C02 . On Day 4 the cells are washed with 0.5 
ml PBS and 0.45 ml of assay medium is added containing inositol-free/serum free media 
10 jiM pargyline 10 mM lithium chloride or 0.4 ml of assay medium and 50|j.l of lOx 
ketanserin (ket) to final concentration of 1 Op.M. The cells are then incubated for 30 min 
at 37^C. The cells are then washed with 0.5 ml PBSand 200|al of fresh/icecold stop 

20 solution (IM KOH; 18 mM Na-borate; 3.8 mM EDTA) is added/well. The solution is 
kept on ice for 5-10 min or until cells were lysed and then neutralized by 200 \xl of 
fresh/ice cold neutralization sol. (7.5 % HCL). The lysate is then transferred into 1,5 ml 
eppendorf tubes and 1 ml of chloroform/methanol (1:2) is added/tube. The solution is 
vortex ed for 15 sec and the upper phase is applied to a Biorad AGl-XS*™ anion 

25 exchange resin (100-200 mesh). Firstly, the resin is washed with water at 1:1.25 WA/ 
and 0.9 ml of upper phase is loaded onto the column. The column is washed with 10 mis 
of 5 mM myo-inositol and 10 ml of 5 mM Na-borate/60mM Na- formate. The inositol 
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tris phosphates are eluted into scintillation vials containing 10 ml of scintillation cocktail 
with 2 ml of 0.1 M fomiic acid/ 1 M ammonium fomiate. The columns are regenerated 
by washing with 10 ml of 0.1 M formic acid/3M ammonium formate and rinsed twice 
with dd H2O and stored at 4°C in water. 

Exemplary results are presented below in Table G: 



TABLE G 



Receptor 


Mutation 


Assay 
Utilized 
Figure No.) 


Signal 
Generated: 
CMV 


Signal 
Generated: 
Endogenous 
Version 
(Relative Light 
Units) 


Signal 
Generated: 

Non- 
Endogenous 
Version 
(Relative 
Light Units) 


Difference 

Between 
. CMVv. 

Wild-type 
. Wild-type 

V. Mutant 


hRUP12 


N/A 


IP3 
(Figure 1) 


317.03 
qjm/mg protein 


3463.29 
q?m/mg protein 




1. 11 Fold 


hRUPlS 


N/A 


cAMP 
(Figure 2) 


8.06 
pmol/cAMP/mg 
protein 


19.10 
pmol/cAMP/mg 
protein 




1. 2.4 Fold <== 




A268K 


8XCRE- 

T T 

(Figure 3) 


3665.43 
LCPS 


83280.17 
LPCS 


61713.6 
LCPS 


1. 23 Fold 

2. 26 %( 


hRUP14 


L246K 


8XCRE- 

LUC 
(Figure 5) 


86.07 
LCPS 


1962.87 
LCPS 


789.73 
LCPS 


1. 23 Fold <= 

2. 60% ( 


hRUPlS 


A398K 


8XCRE- 

LUC 
(Figure 6) 


86.07 
LCPS 


18286.77 
LCPS 


17034.83 
LCPS ~ 


1. 212 Fold 
<= 

2. 1%< 




A398K 


cAMP 
(Figure 7) 


15.00 
pmol/cAMP/mg 
protein 


164.4 
pmoycAMP/mg 
protein 


117.5 
pmol/cAMP/ 
mg protein 


1. 11 Fold 

2. 29% < 


hRUP17 


N/A 


(Figure 9) 


317.03 
cpm/mg protein 


741.07 
qjm/mg protein 




1. 2.3 Fold 


hRUP21 


N/A 


n>3 

(Figure 10) 


730.5 
qjm/mg protein 


1421.9 
cpm/mg protein 




1. 2Fold<:= 


hRUP23 


W275K 


8XCRE- 
LUC 
(Figure 11) 


311.73 
pmol/cAMP/mg 
protein 


13756.00 
pmoL/cAMP/mg 
protein 


9756.87 
pmol/cAMP/ 
mg protein 


1. 44 Fold <= 

2. 30% < 
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Exemplary results of GTPyS assay for detecting constitutive activation, as 
disclosed in Example 4(1) above, v/zs accomplished utilizing Gs:Fusion Protein 
Constructs on human RUP13 and RUP15. Table H belov^ lists the signals generated 
from this assay and the difference in signals as indicated: 



TABLE H 



Receptor: 
Cs Fusion 
Protein 


Assay 
Utilized 


Signal 
Generated: 

CMV 
(cpm bound 

GTP) 


Signal 
Generated: 

Fusion 

Protein 
(cpm bound 
GTP) 


Signal 
Generated: 

CMV+ 
lOpMGDP 
(cpm bound 
GTP) 


Signal 
Generated: 

Fusion 
Protein + 
10^M GDP 
(cpm bound 
GTP) 


Difference 
Between: 

1. CMV V. Fusion 
Protein 

2. CMVHHGDP 

vs. 

Fusion-KJDP 

3. Fusion vs. 
Fusion+CDP 

(cpm bound GTP) 


1 ( Figure 4) 

i 


32494.0 


49351.30 


1 1 148.30 


28834.67 


1. 1.5 Fold <= 

2. 2.6 Fold 

3. 42% < 


hRUP15-C* 


(Figure 8) 


30131.67 


32493.67 


7697.00 


14157.33 


1. 1.1 Folder 

2. 1.8 Fold <= 

3. 56% < 



Example 5 

Fusion Protfin Preparation 
10 a. GFCRrGs Fusion Constuct 

The design of the constitutively activated GPCR-G protein fiision construct was 
accomplished as follows: both the 5' and 3' ends of the rat G protein Gsa (long form; 
Itoh, H. et al., 83 PNAS 3776 (1986)) were engineered to include a Hindm (5'- 
15 AAGCTT-3') sequence thereon. Following confirmation of the correct sequence 
(including the flanking Hindm sequences), the entire sequence was shuttled into 
pcDNA3.1(-) (Invitrogen, cat. no. V795-20) by subcloning using the Hindm restriction 
site of that vector. The correct orientation for the Gsa sequence was determined after 
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subcloning into pcDNA3.1(-). The modified pcDNA3.1(-) containing the rat Gsa gene 
at Hindm sequence was then verified; this vector v/as now available as a "universal" 
Gsa protein vector. The pcDNA3.1(-) vector contains a variety of well-known 
restriction sites upstream of the Hindm site, thus beneficially providing the ability to 
insert, upstream of the Gs protein, the coding sequence of an endogenous, constitutively 
active GPCR. This same approach can be utilized to create other "universal" G protein 
vectors, and, of course, other commercially available or proprietary vectors known to the 
artisan can be utilized - the important criteria is that the sequence for the GPCR be 
upstream and in-fi-ame with that of the G protein. 

RUP13 couples via Gs. For the following exemplary GPCR Fusion Proteins, 
fusion to Gsa was accomplished, 

A RUP13-Gsa Fusion Protein construct was made as follows: primers were 
designed as follows: 

5'-gatc[TCTAGAATlGGAGTCCTCACCCATCCCCCAG -3' (SEQ,ID.NO.:97; sense) 
5'.gatc[GATATC]CGTGACTCCAGCCGGGGTGAGGCGGC-3'(SEQ.ID.NO.:98;antisense). 

Nucleotides in lower caps are included as spacers in the restriction sites 
(designated in brackets) between the G protein and RUP13. The sense and anti-sense 
primers included the restriction sites for Xbal and EcoRV, respectively, such that spacers 
(attributed to the restriction sites) exists between the G protein and RUP15. 

PGR was then utilized to secure the respective receptor sequences for fusion 
within the Gsa universal vector disclosed above, using the following protocol for each: 
lOOng cDNA for RUP15 was added to separate tubes containing 2^1 of each primer 
(sense and anti-sense), 3^L of lOmM dNTPs, lO^iL of 1 OXTaqPlus'^M Precision buffer, 
l|iL of TaqPlus™ Precision polymerase (Stratagene: #600211), and SO^iL of water. 
Reaction temperatures and cycle times for RUP15 were as follows with cycle steps 2 



54 



wo 01/36471 PCT/USOO/31509 
through 4 were repeated 35 times: 94°C for 1 min; 94°C for 30 seconds; 62°C for 20 
sec; 72°C 1 min 40sec; and IT" C 5 min . PGR product for was run on a 1% agarose 
gel and then purified (data not shown). The purified product was digested with Xbal and 
EcoRV and the desired inserts purified and ligated into the Gs universal vector at the 
5 respective restriction site. The positive clones was isolated following transfomiation and 
determined by restriction enzyme digest; expression using 293 cells was accomplished 
following the protocol set forth infra. Each positive clone for RUP15-Gs Fusion Protein 
was sequenced to verify correctness. (See, SEQ.ID.NO.:99 for nucleic acid sequence 
and SEQ.ID.NO.:100 for amino acid sequence ). 
10 RUP15 couples via Gs. For the following exemplary GPCR Fusion Proteins, 

fusion to Gsa was accomplished. 

A RUP15-Gsa Fusion Protein construct was made as follows: primers were 
designed as follows: 

5'.TCTAGAATGACGTCCACCTGCACCAACAGC-3* (SEQ.ID.NO.ilOl; sense) 
1 5 5 '-gatatcGCAGGAAAAGTAGCAGAATCGTAGG AAG-3 ' (SEQ.ID.NO.: 1 02; antisense). 

Nucleotides in lower caps are included as spacers in the restriction sites between 
the G protein and RUP15. The sense and anti-sense primers included the restriction sites 
for EcoRV and Xbal, respectively, such that spacers (attributed to the restriction sites) 
exists between the G protein and RUPl 5. 
20 PGR was then utilized to secure the respective receptor sequences for fiision 

within the Gsa universal vector disclosed above, using the following protocol for each: 
lOOng cDNA for RUPl 5 was added to separate tubes containing 2|al of each primer 
(sense and anti-sense), 3^iL of lOmM dNTPs, IOmL of lOXTaqPlus™ Precision buffer, 
luL of TaqPlus*^^ Precision polymerase (Stratagene: #600211), and SO^L of water. 
25 Reaction temperatures and cycle times for RUPl 5 were as follows with cycle steps 2 
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through 4 were repeated 35 times: 94°C for 1 min; 94°C for 30 seconds; 62°C for 20 
sec; 72°C 1 min 40sec; and 72" C 5 min . PGR product for was run on a 1% agarose 
gel and then purified (data not shown). The purified product was digested ). The 
purified product was digested with EcoRV and Xbal and the desired inserts purified and 
ligated into the Gs universal vector at the respective restriction site. The positive clones 
was isolated following transformation and determined by restriction enzyme digest; 
expression using 293 cells was accomplished following the protocol set forth infra. 
Each positive clone for RUP15-Gs Fusion Protein was sequenced to verify correctness. 
(Sec, SEQ.ID.NO.:!03 for nucleic acid sequence and SEQ.ID.NO.:104 for amino acid 
sequence ). 

b. Gq(6 amino acid de]etion)/Gi Fusion Construct 

The design of a Gq (del)/Gi fiision construct can be accomplished as follows: 
the N-tcrminal six (6) amino acids (amino acids 2 through 7, having the sequence of 
TLESIM (SEQ.ID.NO.: 129) Gaq-subunit will be deleted and the C-teraiinal five (5) 
amino acids, having the sequence EYNLV (SEQ.ID.NO.: 130) will be replace with the 
corresponding amino acids of the Gai Protein, having the sequence DCGLF 
(SEQ.lD.NO.:131). This fiision construct will be obtained by PGR using the foUovWng 
primers: ~ 

5'-gatcaagcucCATGGCGTGCTGCCTGAGCGAGGAG-3' (SEQ.ID.NO.:132) and 

5'-gatcggaiccTTAGAACAGGCCGCAGTCCTTCAGGTTCAGCTGCAGGATGGTG-3' 
(SEQ.ID.NO.:133) 

and Plasmid 63313 which contains the mouse Gaq-wild type version with a 
hemagglutinin tag as template. Nucleotides in lower caps are included as spacers. 

TaqPlus Precision DNA polymerase (Stratagene) will be utilized for the 
amplification by the following cycles, with steps 2 through 4 repeated 35 times: 95°C 
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for 2 min; 95°C for 20 sec; 56°C for 20 sec; 72°C for 2 min; and 72°C for 7 min. The 
PCR product will be cloned into a pCRII-TOPO vector (Invitrogen) and sequenced 
using the ABI Big Dye Terminator kit (P.E. Biosystem). Inserts from a TOPO clone 
containing the sequence of the fusion construct will be shuttled into the expression 
5 vector pcDNA3. 1 (+) at the Hindlll/BamHI site by a 2 step cloning process. 
Example 6 

Tissue Distribution of the disclosed human GPCRs: RT-PCR 

RT-PCR was applied to confirm the expression and to determine the tissue 

10 distribution of several novel human GPCRs. Oligonucleotides utilized were GPCR- 
specific and the human multiple tissue cDNA panels (MTC, Clontech) as templates. 
Taq DNA polymerase (Stratagene) were utilized for the amplification in a 40jal 
reaction according to the manufacturer's instructions. 20^x1 of the reaction will be 
loaded on a 1.5% agarose gel to analyze the RT-PCR products. Table J below lists the 

15 receptors, the cycle conditions and the primers utizilized. 



TABLE J 



Receptor 
Identifier 


Cycle 
Conditions 
Min C), Sec («) 
Cycles 2-4 
repeated 30 
times 


5' Primer 

(seq.id.no.) 


3' Primer 

(seq.id.no.) 


DNA Fragment 


Tissue 
Expression 


hRUPlO 


94^ for 30" 
94° for 10" 
62^C for 20" 
72° for r 
72° for 7' 
* cycles 2-4 
repeated 35 times 


catgtatgc 

CAGCGTCCr 
GCTCC(105) 


GCTATGCCTG 

AAGCCAGTC 

TTGTG(106) 


730bp 


Kidney, 
leukocyte, liver, 
placenta and 
spleen 


hRUPll 


94° for 2' 
94° for 15" 
67°C for 15" 
72° for 45" 
72° for 5' 


GCACCTGCT 
CCTGAGCAC 
CTTCTCC 
(107) 


CACAGCGCT 
GCAGCCCTG 
CAGCTGGC 
(108) 


630bp 


Liver, kidney, 
pancreas, colon, 
small intestinal, 
spleen and 
prostate 
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hRUP12 


94^ for 2' 
94° for 15" 
66°Cforl5" 
72*'for45" 
72° for 5' 


CCAGTGATG 
ACTCTGTCC 
AGCCTG(109) 


CAGACACTT 
GGCAGGGAC 
GAGGTG(llO) 


490bp 


Brain, colon, 
heart, kidney, 
leukocyte, 
pancreas, 
prostate, small 

intestinal, 
spleen, testis, 
and thymus 
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hRUP13 


94° for r 
94° for 15" 
68°Cfor20" 
72° for 1*45" 
72° for 5* 


CTTGTGGTCT 

ACTGCAGCA 

TGTTCCG 


CATATCCCTC 

CGAGTGTCC 

AGCGGC(112) 


700bp 


Placenta and 
lung 


hRUP14 


94° for r 
94° for 15" 
68°Cfor20" 
72° fori' 45" 
72° for 5' 


ATGGATCCT 
TATCATGGC 
TTCCTC(113) 


CAAGAACAG 
GTCTCATCTA 
AGAGCTCC 


700bp 


Not yet 
determined 


hRUP16 


94° for 30" 
94° for 5" 
69°Cfor 15" 
72° for 30" 
72° for 5' 


CTCTGATGC 
CATCTGCTG 
GATTCCTG 


GTAGTCCACT 
GAAAGTCCA 
GTGATCC 
\\\^) 


370bp 


Fetal brain, fetal 
kidney and fetal 
skeletal muscle 


hRUPlS 


94° for T 
94° for 15" 
60°C for 20" 
72° for r 
72° for 5' 


TGGTGGCGA 
TGGCCAACA 
GCGCTC(117) 


GTTGCGCCTT 
AGCGACAGA 
TGACC(118) 


330bp 


Pancreas 


hRUP21 


94° for r 
94° for 15" 
56°C for 20" 
72° for 40" 
♦cycles 2-3 
repeated 30 times 


TCAACCTGT 
ATAGCAGCA 
TCCTC(119) 


AAGGAGTAG 
CAGAATGGT 
TAGCC(120) 




Kidney, lung 
and testis 


hRUP22 


94° for 30" 
94° for 15" 
69°C for 20" 
72° for 40" 
♦cycles 2-3 
repeated 30 times 


GACACCTGT 
CAGCGGTCG 
TGTGTG(121) 


CTGATGGAA 
GTAGAGGCT 
GTCCATCTC 
(122) 




Testis, thymus 
and spleen 


hRUP23 


94° for 2' 
94° for 15" 
60°Cfor20" 
72° for r 
72° for 5' 


GCGCTGAGC 
GCAGACCAG 
TGGCTG(123) 


CACGGTGAC 
GAAGGGCAC 
GAGCTC(124) 


520bp 


Placenta 


hRUP26 


94° for 2' 
94° for 15" 
65°C for 20" 
72° for r 
72° for 5' 


AGCCATCCC 
TGCCAGGAA 
GCATGG(125) 


CCAGGTAGG 
TGTGCAGCA 
CAATGGC 
(126) 


470bp 


pancreas 


hRUP27 


94° for 30" 
94° for 10" 
55°C for 20" 
72° for r 
72° for 3' 
♦cycles 2-4 
repeated 35 times 


CTGTTCAAC 
AGGGCTGGT 
TGGCAAC 
(127) 


ATCATGTCTA 
GACTCATGGT 
GATCC(128) 


890bp 


Brain 
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Example 7 

Protocol: Direct Identification of Inverse Agonists and Agonists 
A. [^^S]GTPyS Assay 
5 Although we have utiHzed endogenous, constitutively active GPCRs for the 

direct identification of candidate compounds as, e.g., inverse agonists, for reasons that 
are not altogether understood, intra-assay variation can become exacerbated. 
Preferably, then, a GPCR Fusion Protein, as disclosed above, is also utilized with a non- 
endogenous, constitutively activated GPCR. We have determined that when such a 

10 protein is used, intra-assay variation appears to be substantially stabilized, whereby an 
effective signal-to-noise ratio is obtained. This has the beneficial result of allowing for a 
more robust identification of candidate compounds. Thus, it is preferred that for direct 
identification, a GPCR Fusion Protein be used and that when utilized, the following 
assay protocols be utilized. 

15 1. Membrane Preparation 

Membranes comprising the constitutively active orphan GPCR Fusion Protein of 
interest and for use in the direct identification of candidate compounds as inverse 
agonists, agonists or partial agonists are preferably prepared as follows: 

a. Materials 

20 "Membrane Scrape Buffer'' is comprised of 20mM HEPES and lOmM EDTA, 

pH 7.4; "Membrane Wash Buffer" is comprised of 20 mM HEPES and 0.1 mM 
EDTA, pH 7.4; "Binding Buffer" is comprised of 20mM HEPES, 100 mM NaCl, and 
lOmMMgCh, pH 7.4 

b. Procedure 

25 All materials will be kept on ice throughout the procedure. Firstly, the media 

will be aspirated fi-om a confluent monolayer of cells, followed by rinse with 10ml cold 
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PBS, followed by aspiration. Thereafter, 5ml of Membrane Scrape Buffer will be added 
to scrape cells; this will be followed by transfer of cellular extract into 50ml centrifuge 
tubes (centrifiiged at 20,000 rpm for 17 minutes at 4°C). Thereafter, the supernatant will 
be aspirated and the pellet will be resuspended in 30ml Membrane Wash Buffer 
5 followed by centrifuge at 20,000 rpm for 1 7 minutes at 4°C. The supematant will then 
be aspirated and the pellet resuspended in Binding Buffer. This will then be 
homogenized using a Brinkman polytron™ homogenizer (15-20 second bursts until the 
all material is in suspension). This is referred to herein as "Membrane Protein". 
2, Bradford Protein Assay 

10 Following the homogenization, protein concentration of the membranes will 

be determined using the Bradford Protein Assay (protein can be diluted to about 
1.5mg/ml, aliquoted and frozen (-SO^'C) for later use; when fi-ozen, protocol for use 
will be as follows: on the day of the assay, frozen Membrane Protein is thawed at 
room temperature, followed by vortex and then homogenized with a polytron at about 

15 12 X 1,000 rpm for about 5-10 seconds; it was noted that for multiple preparations, the 
homogenizor should be thoroughly cleaned between homoginezation of different 
preparations). 

a. Materials 

Binding Buffer (as per above); Bradford Dye Reagent; Bradford Protein 
20 Standard will be utilized, following manufacturer instructions (Biorad, cat. no. 500- 
0006). 

b. Procedure 

Duplicate tubes will be prepared, one including the membrane, and one as a 
control "blank". Each contained SOOul Binding Buffer. Thereafter, 10^1 of Bradford 
25 Protein Standard (Img/ml) will be added to each tube, and 10^1 of membrane Protein 
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will then be added to just one tube (not the blank). Thereafter, 200ul of Bradford Dye 
Reagent will be added to each tube, followed by vortex of each. After five (5) 
minutes, the tubes will be re-vortexed and the material therein will be transferred to 
cuvettes. The cuvettes will then be read using a CECIL 3041 spectrophotometer, at 
5 wavelength 595. 

3. Direct Identiflcation Assay 

a. Materials 

GDP Buffer consisted of 37.5 ml Binding Buffer and 2mg GDP (Sigma, cat. no. 
G-7127), followed by a series of dilutions in Bmding Buffer to obtain 0.2 \iM GDP 
10 (final concentration of GDP in each well was 0.1 jiM GDP); each well comprising a 
candidate compound, has a final volume of 200ul consisting of lOOjil GDP Buffer (final 
concentration, 0.1 |iM GDP), 50ul Membrane Protein in Bmding Buffer, and 50^1 
[^^SJGTPyS (0.6 nM) in Binding Buffer (2.5 ^1 [^^S]GTPyS per lOmI Binding Buffer). 

b. Procedure 

15 Candidate compounds will be preferably screened using a 96-well plate format 

(these can be fi-ozen at -80*^C). Membrane Protein (or membranes with expression 
vector excluding the GPCR Fusion Protein, as control), will be homogenized briefly 
until in suspension. Protein concentration will then be determined using the Bradford 
Protein Assay set forth above. Membrane Protein (and control) will then be diluted to 

20 0.25mg/ml in Binding Buffer (final assay concentration, 12.5ng/well). Thereafter, 100 
III GDP Buffer was added to each well of a Wallac Scintistrip™ (Wallac). A 5ul pin- 
tool will then be used to transfer 5 ^il of a candidate compound into such well (i.e., 5\x\m 
total assay volume of 200 )il is a 1 :40 ratio such that the final screening concentration of 
the candidate compound is lO^M). Again, to avoid contamination, after each transfer 

25 step the pin tool should be rinsed in three reservoirs comprising water (IX), ethanol (IX) 
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and water (2X) - excess liquid should be shaken from the tool after each rinse and dried 
with paper and kimwipes. Thereafter, 50 [il of Membrane Protein will be added to each 
well (a control well comprising membranes without the GPCR Fusion Protein was also 
utilized), and pre-incubated for 5-10 minutes at room temperature. Thereafter, 50p.l of 
5 [^^SJGTPyS (0.6 nM) in Binding Buffer will be added to each well, followed by 
incubation on a shaker for 60 minutes at room temperature (again, in this example, plates 
were covered with foil). The assay will then be stopped by spinning of the plates at 4000 
RPM for 15 minutes at 22°C, The plates will then be aspirated with an 8 channel 
manifold and sealed with plate covers. The plates will then be read on a Wallace 1450 
1 0 using setting 'Trot. #37" (as per manufacturer instructions). 
B. Cyclic AMP Assay 

Another assay approach to directly identified candidate compound was 
accomplished by utilizing a cyclase-based assay. In addition to direct identification, this 
15 assay approach can be utilized as an independent approach to provide confirmation of 
the results from the [^^SJCTPyS approach as set forth above. 

A modified Flash Plate™ Adenylyl Cyclase kit (New England Nuclear; Cat. No. 
SMP004A) was preferably utiUzed for direct identification of candidate compounds as 
inverse agonists and agonists to constitutively activated orphan GPCRs in accordance 
20 with the following protocol. 

Transfected cells were harvested approximately three days after transfection. 
Membranes were prepared by homogenization of suspended cells in buffer containing 
20mM HEPES, pH 7.4 and lOmM MgCl2- Homogenization was performed on ice using 
a Brinkman Polytron™ for approximately 10 seconds. The resulting homogenate is 
25 centrifiiged at 49,000 X g for 15 minutes at 4°C. The resulting pellet was then 
resuspended in buffer containing 20mM HEPES, pH 7.4 and 0.1 mM EDTA, 
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homogenized for 10 seconds, followed by centrifugation at 49,000 X g for 15 minutes at 
4°C. The resulting pellet was then stored at -80°C until utilized. On the day of direct 
identification screening, the membrane pellet as slowly thawed at room temperature, 
resuspended in buffer containing 20mM HEPES, pH 7.4 and lOmM MgCL2, to yield a 
5 final protein concentration of 0.60mg/ml (the resuspended membranes are placed on ice 
until use). 

cAMP standards and Detection Buffer (comprising 2 ^Ci of tracer ['^^I cAMP 

(100 jil] to 11 ml Detection Buffer) were prepared and maintained in accordance with 

the manufacturer's instructions. Assay Buffer was prepared fi-esh for screening and 
10 contained 20mM HEPES, pH 7.4, lOmM MgCh, 20mM phospocreatine (Sigma), 0.1 

units/ml creatine phosphokinase (Sigma), 50 jxM GTP (Sigma), and 0.2 mM ATP 

(Sigma); Assay Buffer was then stored on ice imtil utilized. 

Candidate compounds identified as per above (if fi-ozen, thawed at room 

temperature) were added, preferably, to 96-well plate wells (3|al/well; 12^M final assay 
15 concentration), together with 40 ^il Membrane Protein (30|ig/well) and SOjil of Assay 

Buffer. This admixture was then incubated for 30 minutes at room temperature, with 

gentle shaking. 

Following the incubation, 100^1 of Detection Buffer was added to-each well, 
followed by incubation for 2-24 hoiirs. Plates were then counted in a Wallac 
20 MicroBeta'^'^ plate reader using "Prot. #3 1 " (as per manufacturer instructions). 

A representative screening assay plate (96 well format) result is presented in 
Figure 12. Each bar represents the results for a different compound in each well, plus 
RUP13-Gsa Fusion Protein construct, as prepared in Example 5(a) above. The 
representative results presented in Figure 12 also provide standard deviations based upon 
25 the mean results of each plate ("m") and the mean plus two arbitrary preference for 
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selection of inverse agonists as "leads" from the primary screen involves selection of 
candidate compounds that that reduce the per cent response by at least the mean plate 
response, minus two standard deviations. Conversely, an arbitrary preference for 
selection of an agonists as "leads" from the primary screen involves selection of 
5 candidate compounds that increase the per cent response by at least the mean plate 
response, plus the two standard deviations. Based upon these selection processes, the 
candidate compounds in the following wells were directly identified as putative inverse 
agonist (Compound A) and agonist (Compound B) to RUP13 in wells A2 and G9, 
respectively. Sic Figure 12. It is noted for clarity: these compounds have been directly 
10 identified without any knowledge of the endogenous ligand for this GPCR. By focusing 
on assay techniques that are based upon receptor function, and not compound binding 
affinit)-, uc arc able lo ascertain compounds that are able to reduce the fiinctional 
activity of ihjs receptor (Compound A) as well as increase the functional activity of the 
receptor (Compound B). Based upon the location of these receptor in lung tissue (see, 
15 for example, hRUPlS and hRUP21 in Example 6), pharmaceutical agents can be 
developed for potential therapeutic treatment of lung cancer. 

References cited throughout this patent document, mcluding co-pending and 
related patent applications, unless otherwise indicated, are fully incorporated herein by 
reference. Modifications and extension of the disclosed inventions that are within the 
20 purview of the skilled artisan are encompassed within the above disclosure and the 
claims that follow. 

Although a variety of expression vectors are available to those in the art, for 
purposes of utilization for both the endogenous and non-endogenous human GPCRs, it is 
most preferred that the vector utilized be pCMV. This vector was deposited with the 
25 American Type Culture Collection (ATCC) on October 13, 1998 (10801 University 
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Blvd., Manassas, VA 201 10-2209 USA) under the provisions of the Budapest Treaty for 
the International Recognition of the Deposit of Microorganisms for the Purpose of Patent 
Procedure. The DNA was tested by the ATCC and determined to be viable. The ATCC 
has assigned the following deposit number to pCMV: ATCC #203351. 
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CLAIMS 

What is claimed is: 

1 . AG protein-coupled receptor encoded by an amino acid sequence of 
5 SEQ.ID.NO.:2. 

2. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 1, 

3. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:!. 

4. A host cell comprising the plasmid of claim 3. 

10 5. A G protein-coupled receptor encoded by an amino acid sequence of 

SEQ.ID.NO. :4. 

6. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 5. 

7. A plasmid comprising a vector and the cDNA of SEQ,rD.NO.:3. 
15 8. A host cell comprising the plasmid of claim 7. 

9. AG protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:6. 

10. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 9. 

20 1 1 . A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:5, 

12. A host cell comprising the plasmid of claim 11. 

13. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:8. 

14. A non-endogenous, constitutively activated version of the G protein-coupled 
25 receptor of claim 13. 
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15. A plasmid comprising a vector and the cDNA of SEQ.ID,NO.:7. 

16. A host cell comprising the plasmid of claim 15. 

17. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.no.: 10. 

1 8. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 17. 

19. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:9. 

20. A host cell comprising the plasmid of claim 19. 

21. AG protein-coupled receptor encoded by an amino acid sequence of 
SEO ID.NO .12. 

22. A not>-cndogcnous, constitutively activated version of the G protein-coupled 
receptor of claim 21 comprising an amino acid sequence of SEQ.ID.N0.84. 

23. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:! 1. 

24. A host cell comprising the plasmid of claim 23. 

25. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.: 14. 

26. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 25 comprising an amino acid sequence of SEQ.ID.N0.88. 

27. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.: 13. 

28. A host cell comprising the plasmid of claim 27. 

29. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:16. 

30. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 29 comprising an amino acid sequence of SEQ.ID.NO. :92. 

31. A plasmid comprising a vector and the cDNA of SEQ.ED,NO.:15. 
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32. A host cell comprising the plasmid of claim 3 1 . 

33. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID,NO.:18. 

34. A non-endogenous, constitutively activated version of the G protein-coupled 
5 receptor of claim 33 . 

35. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:17. 

36. A host cell comprising the plasmid of claim 35. 

37. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:20. 

10 38, A non-endogenous, constitutively activated version of the G protein-coupled 

receptor of claim 37. 

39. A plasmid comprising a vector and the cDNA of SE.ED.NO.:19. 

40. A host cell comprising the plasmid of claim 39. 

4 1 . A G protein-coupled receptor encoded by an amino acid sequence of 
15 SEQ.rD.NO.:22. 

42. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 4 1 . 

43. A plasmid comprising a vector and the cDNA of SEQ.rD.NO.:21. ~ 

44. A host cell comprising the plasmid of claim 43. 

20 45. A G protein-coupled receptor encoded by an amino acid sequence of 

SEQ.rD.NO.:24. 

46. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 45. 

47. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:23. 
25 48. A host cell comprising the plasmid of claim 47. 



69 



BNSDOCID: <WO 0136471 A2_t_> 



wo 01/36471 ^^^r^.r. 

PCT/USOO/31509 

49. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:26. 

50. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 49. 

5 5 1 . A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:25, 

52. A host cell comprising the plasmid of claim 51. 

53. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:28. 

54. A non-endogenous, constitutively activated version of the G protein-coupled 
1 0 receptor of claim 5 3 . 

55. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:27. 

56. A host cell comprising the plasmid of claim 55. 

57. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:30. 

15 58. A non-endogenous, constitutively activated version of the G protein-coupled 

receptor of claim 57. 

59. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:29. 

60. A host cell comprising the plasmid of claim 59. « 

61. A G protein-coupled receptor encoded by an amino acid sequence of 
20 SEQ.ID.NO.:32. 

62. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 61 comprising an amino acid sequence of SEQ.rD.NG.:96. 

63. A plasmid comprising a vector and the cDNA of SEQ.rD.NO.:95. 

64. A host cell comprising the plasmid of claim 63. 
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65. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.no. :34. 

66. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 65. 

5 67. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:33. 

68. A host cell comprising the plasmid of claim 67. 

69. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:36. 

70. A non-endogenous, constitutively activated version of the G protein-coupled 
1 0 receptor of claim 69. 

71 . A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:35. 

72. A host cell comprising the plasmid of claim 71. 

73. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:38. 

15 74. A non-endogenous, constitutively activated version of the G protein-coupled 

receptor of claim 73. 

75. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:37. 

76. A host cell comprising the plasmid of claim 75. _ 

77. A G protein-coupled receptor encoded by an amino acid sequence of 
20 SEQ.ID.NO.:40. 

78. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 77. 

79. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:39. 

80. A host cell comprising the plasmid of claim 79. 

25 



71 

BNSDOCID: <WO 0136471 A2 I > 



wo 01/36471 



PCT/USOO/31509 




wo 01/36471 



PCT/USOO/31509 




o o o 



CM 

ui3)OJd 6ui/dlAIV^/|o^d 



BNSDOCID: <WO 0136471A2J_: 



2/12 



wo 01/36471 



PCT/USOO/31509 




00 r 



o 


o 


o 


o 


o 


o 


o 


o 


o 


lO 


o 


lO 






CM 



sdon 



BNSDOCID: <WO _0136471A2_I_> 



3/12 



wo 01/36471 



PCTAJSOO/31509 



CO 
CO 

< 

0> c^ 



di) 

3 



T3 
C 

CQ 

CL 
I- 

o 



o c 

cd 

V3 



9 

CO 



Dl 



mi 



o 

o 




dlO punoq ludo 



BNSDOCID: <WO 0136471 A2 I 



4/12 



wo 01/36471 



PCT/USOO/31509 




SdOl 



BNSDOCID:<WO 0136471A2. I > 



5/12 



wo 01/36471 



PCT/USOO/31509 




BNSDOCrD: <WO 0136471A2_I_> 



6/12 



wo 01/36471 



PCT/USOO/31509 




I \ 1 

o o o 

o o 

CM f- 



BNSOOCtD: <WO 0136471A2_1_> 



7/12 



wo 01/36471 



PCT/USOO/31509 



(A 
O 

a! 
or 



Dl 




00 

s? 

Z3 



dlO punoq tudo 



8/12 



BNSOOCID: <WO 0136471A2 I > 



wo 01/36471 



PCTAJSOO/31509 




I 1 1 i 

o o o o 

m o iD 

to CNj 



(uja^ojd Bui/ujdo) 
uoi^einLunoov Cdl 



8NSDOC10; <WO 0136471 A2_l_> 



9/12 



wo 01/36471 



PCT/USOO/31509 



O 

CO 

o> 

CM 



5k 
(0 
(A 

CO 

0. 




I ! ■ I I 

o o o o 

O O !0 

lO O iO 

(u!8)OJd Btu/Ludo) 
uoijeinuinoov Cdl 



BNSDOCID; <WO 013647lA2J_; 



10/12 



wo 01/36471 



PCT/USOO/31509 




BNSDOCID: <WO 0136471 A2_L> 



11/12 



wo 01/36471 



PCTAJSOO/31509 




12/12 



BNSOOCID: <WO 0136471 A2_l_> 



wo 01/36471 



PCT/USOO/31509 



SEQUENCE LISTING 

<110> Arena Pharmaceuticals, Inc. 
Chen, Rupong 
Dang, Huong T. 
Lowitz, Kevin P. 

<120> Non-Endogenous, Constitutively Activated Human G Protein-Coupled Receptors 

<130> AREN0087 

<150> 60/166,088 

<151> 1999-11-17 

<150> 60/166,369 

<151> 1999-11-17 

<150> 60/166,099 

<151> 1999-11-17 

<150> 61/171,902 

<151> 1999-12-23 

<150> 60/171,901 

<151> 1999-12-23 

<150> 60/171,900 

<151> 1999-12-23 

<150> 60/181, 749 

<151> 2000-02-11 

<150> 60/186*, 258 

<151> 20C0-03-14 

<I50> 60/189,259 

<151> 2000-03-14 

<150> 60/195, 899 

<151> 2000-04-10 

<150> 60/196, 078 

<151> 2000-04-10 

<150> 60/195, 898 

<151> 2000-04-10 

<150> 60/200, 419 

<151> 2000-04-28 

<150> 60/203, 630 

<151> 2000-05-12 

<150> 60/210, 741 

<151> 2000-06-12 

<150> 60/210, 982 

<151> 2000-06-12 

<150> 60/226,760 

<151> 2000-08-21 

<150> 60/235, 779 

<151> 2000-09-26 

Page 1 



BNSDOCID: <WO 013647 1A2J_> 



wo 01/36471 



PCT/USOO/31509 



<150> 60/235,418 
<151> 2000-09-26 

<150> 60/242,332 
<151> 2000-10-20 

<150> 60/242,343 
<151> 2000-10-20 

<150> 60/243,019 
<151> 2000-10-24 

<160> 133 

<170> Patentin version 3.0 

<210> 1 

<211> 1155 

<212> DNA 

<213> Homo sapiens 

<400> 1 

atggcagccc agaatggaaa caccagtttc acacccaact ttaatccacc ccaagaccat 60 

gcctcctccc tctcctttaa cttcagttat ggtgattatg acctccctat ggatgaggat 120 

gaggacatga ccaagacccg gaccttcttc gcagccaaga tcgtcattgg cattgcactg 180 

gcaggcatca tgctggtctg cggcatcggt aactttgtct ttatcgctgc cctcacccgc 24 0 

tataagaagt tgcgcaacct caccaatctg ctcattgcca acctggccat ctccgacttc 300 

ctggtggcca tcatctgctg ccccttcgag atggactact acgtggtacg gcagctctcc 360 

tgggagcatg gccacgtgct ctgtgcctcc gtcaactacc tgcgcaccgt ctccctctac 420 

gtctccacca atgccttgct ggccattgcc attgacagat atctcgccat cgttcacccc 480 

ttgaaaccac ggatgaatta tcaaacggcc tccttcctga tcgccttggt ctggatggtg 540 

tccattctca ttgccatccc atcggcttac tttgcaacag aaacggtcct ctttattgtc 600 

aagagccagg agaagatctt ctgtggccag atctggcctg tggatcagca gctctactac 660 

aagtcctact tcctcttcat ctttggtgtc gagttcgtgg gccctgtggt caccatgacc 720 

ctgtgctatg ccaggatctc ccgggagctc tggttcaagg cagtccctgg gttccagacg 780 

gagcagattc gcaagcggct gcgctgccgc aggaagacgg tcctggtgct catgtgcatt 840 

ctcacggcct atgtgctgtg ctgggcaccc ttctacggtt tcaccatcgt tcgtgacttc 900 

ttccccactg tgttcgtgaa ggaaaagcac tacctcactg ccttctacgt ggtcgagtgc 960 

atcgccatga gcaacagcat gatcaacacc gtgtgcttcg tgacggtcaa gaacaacacc 1020 

atgaagtact tcaagaagat gatgctgctg cactggcgtc cctcccagcg ggggagcaag 1080 

tccagtgctg accttgacct cagaaccaac ggggtgccca ccacagaaga ggtggactgt 1140 

atcaggctga agtga 1155 



<210> 2 
<211> 384 
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<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Ala Ala Gin Asn Gly Asn Thr Ser Phe Thr Pro Asn Phe Asn Pro 
15 10 15 

Pro Gin Asp His Ala Ser Ser Leu Ser Phe Asn Phe Ser Tyr Gly Asp 
20 25 30 

Tyr Asp Leu Pro Met Asp Glu Asp Glu Asp Met Thr Lys Thr Arg Thr 
35 40 45 

Phe Phe Ala Ala Lys lie Val lie Gly lie Ala Leu Ala Gly He Met 
50 55 60 

Leu Val Cys Gly He Gly Asn Phe Val Phe He Ala Ala Leu Thr Arg 
65 70 75 80 

Tyr Lys Lys Leu Arg Asn Leu Thr Asn Leu Leu He Ala Asn Leu Ala 
85 90 95 

He Ser Asp Phe Leu Val Ala He He Cys Cys Pro Phe Glu Met Asp 
100 105 110 

Tyr Tyr Val Val Arg Gin Leu Ser Trp Glu His Gly His Val Leu Cys 
115 120 125 

Ala Ser Val Asn Tyr Leu Arg Thr Val Ser Leu Tyr Val Ser Thr Asn 
130 135 140 

Ala Leu Leu Ala He Ala He Asp Arg Tyr Leu Ala He Val His Pro 
145 150 155 160 

Leu Lys Pro Arg Met Asn Tyr Gin Thr Ala Ser Phe Leu He Ala Leu 
165 170 175 

Val Trp Met Val Ser He Leu He Ala He Pro Ser Ala Tyr Phe Ala 
180 185 190 

Thr Glu Thr Val Leu Phe He Val Lys Ser Gin Glu Lys He Phe Cys 
195 200 205 

Gly Gin He Trp Pro Val Asp Gin Gin Leu Tyr Tyr Lys Ser Tyr Phe 
210 215 220 

Leu Phe He Phe Gly Val Glu Phe Val Gly Pro Val Val Thr Met Thr 
225 230 235 240 

Leu Cys Tyr Ala Arg He Ser Arg Glu Leu Trp Phe Lys Ala Val Pro 
245 250 255 

Gly Phe Gin Thr Glu Gin He Arg Lys Arg Leu Arg Cys Arg Arg Lys 
260 265 270 

Thr Val Leu Val Leu Met Cys He Leu Thr Ala Tyr Val Leu Cys Trp 
275 280 285 

Ala Pro Phe Tyr Gly Phe Thr He Val Arg Asp Phe Phe Pro Thr Val 
290 295 300 

Phe Val Lys Glu Lys His Tyr Leu Thr Ala Phe Tyr Val Val Glu Cys 
305 310 315 320 
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lie Ala Met Ser Asn Ser Met lie Asn Thr Val Cys Phe Val Thr Val 
325 330 335 

Lys Asn Asn Thr Met Lys Tyr Phe Lys Lys Met Met Leu Leu His Trp 
340 345 350 

Arg Pro Ser Gin Arg Gly Ser Lys Ser Ser Ala Asp Leu Asp Leu Arg 
355 360 365 

Thr Asn Gly Val Pro Thr Thr Glu Glu Val Asp Cys lie Arg Leu Lys 
370 375 380 

<210> 3 

<211> 1260 

<212> DNA 

<213> Homo sapiens 

<400> 3 

atgctggcag ctgcctttgc agactctaac tccagcagca tgaatgtgtc ctttgctcac 60 

ctccactttg ccggagggta cctgccctct gattcccagg actggagaac catcatcccg 120 

gctctcttgg tggctgtctg cctggtgggc ttcgtgggaa acctgtgtgt gattggcatc 180 

ctccttcaca atgcttggaa aggaaagcca tccatgatcc actccctgat tctgaatctc 240 

agcctggctg atctctccct cctgctgttt tctgcaccta tccgagctac ggcgtactcc 300 

aaaagtgttt gggatctagg ctggtttgtc tgcaagtcct ctgactggtt tatccacaca 360 

tgcatggcag ccaagagcct gacaatcgtt gtggtggcca aagtatgctt catgtatgca 420 

agtgacccag ccaagcaagt gagtatccac aactacacca tctggtcagt gctggtggcc 4 80 

atctggactg tggctagcct gttacccctg ccggaatggt tctttagcac catcaggcat 540 

catgaaggtg tggaaatgtg cctcgtggat gtaccagctg tggctgaaga gtttatgtcg 600 

atgtttggta agctctaccc actcctggca tttggccttc cattattttt tgccagcttt 660 

tatttctgga gagcttatga ccaatgtaaa aaacgaggaa ctaagactca aaatcttaga 720 

aaccagatac gctcaaagca agtcacagtg atgctgctga gcattgccat catctctgct 780 

ctcttgtggc tccccgaatg ggtagcttgg ctgtgggtat ggcatctgaa ggctgcaggc 840 

ccggccccac cacaaggttt catagccctg tctcaagtct tgatgttttc catctcttca 900 

gcaaatcctc tcatttttct tgtgatgtcg gaagagttca gggaaggctt gaaaggtgta 960 

tggaaatgga tgataaccaa aaaacctcca actgtctcag agtctcagga aacaccagct 1020 

ggcaactcag agggtcttcc tgacaaggtt ccatctccag aatccccagc atccatacca 1080 

gaaaaagaga aacccagctc tccctcctct ggcaaaggga aaactgagaa ggcagagatt 1140 

cccatccttc ctgacgtaga gcagttttgg catgagaggg acacagtccc ttctgtacag 1200 

gacaatgacc ctatcccctg ggaacatgaa gatcaagaga caggggaagg tgttaaatag 1260 

<210> 4 

<211> 419 

<212> PRT 

<213> Homo sapiens 
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<400> 



4 



Met Leu Ala Ala Ala Phe Ala Asp Ser Asn Ser Ser Ser Met Asn Val 
15 10 15 

Ser Phe Ala His Leu His Phe Ala Gly Gly Tyr Leu Pro Ser Asp Ser 
20 25 30 

Gin Asp Trp Arg Thr lie He Pro Ala Leu Leu Val Ala Val Cys Leu 
35 40 45 

Val Gly Phe Val Gly Asn Leu Cys Val He Gly He Leu Leu His Asn 
50 55 60 

Ala Trp Lys Gly Lys Pro Ser Met He His Ser Leu He Leu Asn Leu 
65 70 75 80 

Ser Leu Ala Asp Leu Ser Leu Leu Leu Phe Ser Ala Pro He Arg Ala 
85 90 95 

Thr Ala Tyr Ser Lys Ser Val Trp Asp Leu Gly Trp Phe Val Cys Lys 
100 105 110 

Ser Ser Asp Trp Phe He His Thr Cys Met Ala Ala Lys Ser Leu Thr 
115 120 125 

He Val Val Val Ala Lys Val Cys Phe Met Tyr Ala Ser Asp Pro Ala 
130 135 140 

Lys Gin Val Ser He His Asn Tyr Thr He Trp Ser Val Leu Val Ala 
145 150 155 160 

He Trp Thr Val Ala Ser Leu Leu Pro Leu Pro Glu Trp Phe Phe Ser 
165 170 175 

Thr He Arg His His Glu Gly Val Glu Met Cys Leu Val Asp Val Pro 
180 185 190 

Ala Val Ala Glu Glu Phe Met Ser Met Phe Gly Lys Leu Tyr Pro Leu 
195 200 205 

Leu Ala Phe Gly Leu Pro Leu Phe Phe Ala Ser Phe Tyr Phe Trp Arg 
210 215 220 

Ala Tyr Asp Gin Cys Lys Lys Arg Gly Thr Lys Thr Gin Asn Leu Arg 
225 230 235 240 

Asn Gin He Arg Ser Lys Gin Val Thr Val Met Leu Leu Ser He Ala 
245 250 255 

He He Ser Ala Leu Leu Trp Leu Pro Glu Trp Val Ala Trp Leu Trp 
260 265 270 

Val Trp His Leu Lys Ala Ala Gly Pro Ala Pro Pro Gin Gly Phe He 
275 280 285 

Ala Leu Ser Gin Val Leu Met Phe Ser He Ser Ser Ala Asn Pro Leu 
290 295 300 

He Phe Leu Val Met Ser Glu Glu Phe Arg Glu Gly Leu Lys Gly Val 
305 310 315 320 

Trp Lys Trp Met He Thr Lys Lys Pro Pro Thr Val Ser Glu Ser Gin 



325 



330 



335 
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Glu Thr Pro Ala Gly Asn Ser Glu Gly Leu Pro Asp Lys Val Pro Ser 
340 345 350 

Pro Glu Ser Pro Ala Ser He Pro Glu Lys Glu Lys Pro Ser Ser Pro 
355 360 365 

Ser Ser Gly Lys Gly Lys Thr Glu Lys Ala Glu He Pro He Leu Pro 
370 375 380 

Asp Val Glu Gin Phe Trp His Glu Arg Asp Thr Val Pro Ser Val Gin 
385 390 395 400 

Asp Asn Asp Pro He Pro Trp Glu His Glu Asp Gin Glu Thr Gly Glu 
405 410 415 

Gly Val Lys 

<210> 5 

<211> 1014 

<212> DNA 

<213> Homo :,.»p.fn3 

<400> 5 



atggggaacg 


at t etc;: ci-; 


cx dcqjgtat 


ggggattaca 


gcgacctctc 


ggaccgccct 


60 


gtggactgcc 




v-'t rjcrt ggcc 


atcgacccgc 


tgcgcgtggc 


cccgctccca 


120 


ctgtatgccg 


ccat c ct: 


q-:tqr;:ig9tg 


ccgggcaatg 


ccatggtggc 


ctgggtggct 


180 


gggaaggtgg 




qc;tg^gtqcc 


acctggttgc 


tccacctggc 


cgtggcggat 


240 


ttgctgtgct 


gcttg'-ctc? 


gcccuicctg 


gcagtgccca 


ttgcccgtgg 


aggccactgg 


300 


ccgtatggtg 


cagtg-.^'^ct -1 


t ; gggcgctg 


ccctccatca 


tcctgctgac 


catgtatgcc 


360 


agcgtcctgc 




t ct cagtgcc 


gacctctgct 


tcctggctct 


cgggcctgcc 


420 


tggtggtcta 


cggctcugcq 


qgcgtgcggg 


gtgcaggtgg 


cctgtggggc 


agcctggaca 


480 


ctggccttgc 


tgctc^ccgt 


gccctccgcc 


atctaccgcc 


ggctgcacca 


ggagcacttc 


540 


ccagcccggc 


tgcaqt<:t t^t 


gqrggactac 


ggcggctcct 


ccagcaccga 


gaatgcggtg 


600 


actgccatcc 


ggtttct tt t 


tggcttcctg 


gggcccctgg 


tggccgtggc 


cagctgccac 


660 


agtgccctcc 


tgtgctqgac 


agcccgacgc 


tgccggccgc 


tgggcacagc 


cattgtggtg 


720 


gggttttttg 


tctgcTigggc 


occctaccac 


ctgctggggc 


tggtgctcac 


tgtggcggcc 


780 


ccgaactccg 


cactcctggc 


cagggccctg 


cgggctgaac 


ccctcatcgt 


gggccttgcc 


840 


ctcgctcaca 


gctgcctcaa 


tcccatgctc 


ttcctgtatt 


ttgggagggc 


tcaactccgc 


900 


cggtcactgc 


cagctgccr c 


t CdCtgggcc 


ctgagggagt 


cccagggcca 


ggacgaaagt 


960 


gtggacagca 


agaaatccac 


cagccatgac 


ctggtctcgg 


agatggaggt 


gtag 


1014 



<210> 6 

<211> 337 

<212> PRT 

<213> Homo sapiens 
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<400> 6 

Met Gly Asn Asp Ser Val Ser Tyr Glu Tyr Gly Asp Tyr Ser Asp Leu 
15 10 15 

Ser Asp Arg Pro Val Asp Cys Leu Asp Gly Ala Cys Leu Ala He Asp 
20 25 30 

Pro Leu Arg Val Ala Pro Leu Pro Leu Tyr Ala Ala He Phe Leu Val 
35 40 45 

Gly Val Pro Gly Asn Ala Met Val Ala Trp Val Ala Gly Lys Val Ala 
50 55 60 

Arg Arg Arg Val Gly Ala Thr Trp Leu Leu His Leu Ala Val Ala Asp 
65 70 75 80 

Leu Leu Cys Cys Leu Ser Leu Pro He Leu Ala Val Pro He Ala Arg 
85 90 95 

Gly Gly His Trp Pro Tyr Gly Ala Val Gly Cys Arg Ala Leu Pro Ser 
100 105 110 

He He Leu Leu Thr Met Tyr Ala Ser Val Leu Leu Leu Ala Ala Leu 
115 120 125 

Ser Ala Asp Leu Cys Phe Leu Ala Leu Gly Pro Ala Trp Trp Ser Thr 
130 135 140 

Val Gin Arg Ala Cys Gly Val Gin Val Ala Cys Gly Ala Ala Trp Thr 
145 150 155 160 

Leu Ala Leu Leu Leu Thr Val Pro Ser Ala He Tyr Arg Arg Leu His 
165 170 175 

Gin Glu His Phe Pro Ala Arg Leu Gin Cys Val Val Asp Tyr Gly Gly 
180 185 190 

Ser Ser Ser Thr Glu Asn Ala Val Thr Ala He Arg Phe Leu Phe Gly 
195 200 205 

Phe Leu Gly Pro Leu Val Ala Val Ala Ser Cys His Ser Ala Leu Leu 
210 215 220 

Cys Trp Ala Ala Arg Arg Cys Arg Pro Leu Gly Thr Ala He Val Val 
225 230 235 240 

Gly Phe Phe Val Cys Trp Ala Pro Tyr His Leu Leu Gly Leu Val Leu 
245 250 255 

Thr Val Ala Ala Pro Asn Ser Ala Leu Leu Ala Arg Ala Leu Arg Ala 
260 265 270 

Glu Pro Leu He Val Gly Leu Ala Leu Ala His Ser Cys Leu Asn Pro 
275 280 285 

Met Leu Phe Leu Tyr Phe Gly Arg Ala Gin Leu Arg Arg Ser Leu Pro 
290 295 300 

Ala Ala Cys His Trp Ala Leu Arg Glu Ser Gin Gly Gin Asp Glu Ser 
305 310 315 320 

Val Asp Ser Lys Lys Ser Thr Ser His Asp Leu Val Ser Glu Met Glu 
325 330 335 
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<210> 7 

<211> 1272 

<212> DNA 

<213> Homo sapiens 






<400> 7 
atgttgtgtc 


accgtggtgg 


ccagctgata 


gtgccaatca 


tcctgcaggg 


gtagaagact 


ccagaacctt 


ctctcaggcc 


gaacttcata 


acctgagctc 


tccatctccc 


tctctctcct 


ttctctccct 


caccctcctc 


tgctccctct 


gcctttacca 


gggccctgcc 


accccacctc 


ttcctcgctg 


gtgtctgcct 


ctggagtttg 


tcctgggcct 


ggtggggaac 


agtttggccc 


acgcggccct 


ggacctccaa 


cacggtgttc 


ctggtcagcc 


ctgatcagca 


acctgcccct 


ccgcgtggac 


tactacctcc 


ggggctgctg 


cctgcaaagt 


caacctcttc 


atgctgtcca 


gtcttcctca 


cagccatcgc 


actcaaccgc 


tacctgaagg 


ctgagccgtg 


cttccgtggg 


ggcagctgcc 


cgggtqqccq 


ctgctcctca 


acgggcacct 


gctcctgagc 


accttctccg 


agggtgggca 


cgaagccctc 


ggcctcgctc 


cgctggcacc 


ttcttcctgc 


cactggcgct 


catcctcttt 


gctattgtga 


aaccgtggtc 


tgggcgggca 


ggcaggcccg 


cagagggcca 


gtggccgtct 


acaccatctg 


cttcttgccc 


agcatcatct 


gctttctggc 


tgtccgcctg 


ccgatccctg 


gacctctgca 


ctggccttca 


cctacctcaa 


cagtgtcctg 


gaccccgtgc 


aacttcctcc 


accagagccg 


ggccttgctg 


ggcctcacgc 


agcgacgaga 


gctcctacca 


accctccagg 


cagtggcgct 


gcggaggcca 


tagggaagct 


gaaagtgcag 


ggcgaggtct 


tcccagggct 


ga 







<210> 8 

<211> 423 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Met Leu Cys His Arg Gly Gly Gin Leu lie Val 
15 10 

Cys Pro Glu His Ser Cys Arg Gly Arg Arg Leu 



PCT/USOO/31509 



tcccactittg 




fin 


V— ^ <^ ^ w w w a CL 


yu>cLy^L,L-a 


± ^ u 






1 Fin 
± o u 


^ "-y y y y y y 


ci^ c~ c V c Y n 7^ 
y cv^^L.i^L.yyci 




1^ ^ L. ^ ^ \_ \_ \^ 


PirK^" cc^ nc%f c 


nn 

J \J VJ 




"t* n s ^ ^ ^ 


"5 c n 


uyy uyy^L-y^ 


t,ya.Ct.t.CCL.C 


^ ^ U 


^mm^ \^ a \_ u o ^ o. w 


uyy^yi^i- L. L 


4 Rn 
^1 o u 


^ ^ d d ^ \^ \J w CI 


yy^^Ayo^y u l. 


^ /I n 
J fl u 


L.yy uyv^ay^^ 


d^d^^dt^y cy 


fin n 


yyyy^^ L.i_L.y 


yy i-y y y L.cL L. c 


fi <^ n 


y(^^^^ Lv..^ ^y 




Ton 


O ^ ^ ^_rCl W U O 


L.y^ uyyciy 


"7 R n 


gcattgggct 


caccatccgg 


840 


tgcgtgtgct 


ggccatggtg 


900 


ttggcatggc 


ttccatggtg 


960 


cacagctctt 


ccatggctcc 


1020 


tctactgctt 


ctctagcccc 


1080 


ggggccggca 


gggcccagtg 


1140 


accgggaggc 


ctctaggaag 


1200 


ctctggaaaa 


ggaaggctcc 


1260 






1272 



Pro lie lie Pro Leu 
15 

Gin Asn Leu Leu Ser 
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20 25 30 

Gly Pro Trp Pro Lys Gin Pro Met Glu Leu His Asn Leu Ser Ser Pro 
35. 40 45 

Ser Pro Ser Leu Ser Ser Ser Val Leu Pro Pro Ser Phe Ser Pro Ser 
50 55 60 

Pro Ser Ser Ala Pro Ser Ala Phe Thr Thr Val Gly Gly Ser Ser Gly 
65 70 75 80 

Gly Pro Cys His Pro Thr Ser Ser Ser Leu Val Ser Ala Phe Leu Ala 
85 90 95 

Pro lie Leu Ala Leu Glu Phe Val Leu Gly Leu Val Gly Asn Ser Leu 
100 105 110 

Ala Leu Phe lie Phe Cys lie His Thr Arg Pro Trp Thr Ser Asn Thr 
ll'j 120 125 

Val Phe Leu Val Sc»r Leu Val Ala Ala Asp Phe Leu Leu He Ser Asn 
130 135 140 

Leu Pro Leu Arri Val Asp Tyr Tyr Leu Leu His Glu Thr Trp Arg Phe 
145 :t)0 155 160 

Gly Ala AIa Ai.i Cys Lyr> Val Asn Leu Phe Met Leu Ser Thr Asn Arg 
i^- 170 175 

Thr Ala Ser v j i v.i 1 Pho Leu Thr Ala He Ala Leu Asn Arg Tyr Leu 
Irv 185 190 

Lys Val Val Gin Fro H^i> His Val Leu Ser Arg Ala Ser Val Gly Ala 
195 200 205 

Ala Ala Arg Val AIo Gly Gly Leu Trp Val Gly lie Leu Leu Leu Asn 
210 215 220 

Gly His Leu U>u Leu Ser Thr Phe Ser Gly Pro Ser Cys Leu Ser Tyr 
225 230 235 240 

Arg Val Gly Thr Lys Fro Ser Ala Ser Leu Arg Trp His Gin Ala Leu 
:?4b 250 255 

Tyr Leu Leu Glu Ph*- Phe Leu Pro Leu Ala Leu He Leu Phe Ala He 
260 265 270 

Val Ser He Gly Leu Thr He Arg Asn Arg Gly Leu Gly Gly Gin Ala 
275 280 285 

Gly Pro Gin Arg Ala Met Arg Val Leu Ala Met Val Val Ala Val Tyr 
290 295 300 

Thr He Cys Phe L^u Pro Ser He He Phe Gly Met Ala Ser Met Val 
305 310 315 320 

Ala Phe Trp Leu Ser Ala Cys Arg Ser Leu Asp Leu Cys Thr Gin Leu 
325 330 335 

Phe His Gly Ser Leu Ala Phe Thr Tyr Leu Asn Ser Val Leu Asp Pro 
340 345 350 

Val Leu Tyr Cys Phe Ser Ser Pro Asn Phe Leu His Gin Ser Arg Ala 
355 360 365 

Page 9 



BNSDOCID: <WO 01 36471 A2_l_> 



wo 01/36471 



PCT/USOO/31509 



Leu Leu Gly Leu Thr Arg Gly Arg Gin Gly Pro Val Ser Asp Glu Ser 
370 375 380 

Ser Tyr Gin Pro Ser Arg Gin Trp Arg Tyr Arg Glu Ala Ser Arg Lys 
385 390 395 400 

Ala Glu Ala lie Gly Lys Leu Lys Val Gin Gly Glu Val Ser Leu Glu 
405 410 415 

Lys Glu Gly Ser Ser Gin Gly 
420 



<210> 9 
<211> 966 
<212> DNA 
<213> Homo 


sapiens 












<400> 9 
a tgaaccaga 


ctttgaatag 


cagtgggacc 


gtggagtcag 


ccctaaacta 


ttccagaggg 


60 


agcacagtgc 


acacggccta 


cctggtgctg 


agctccctgg 


ccatgttcac 


ctgcctgtgc 


120 


gggatggcag 


gcaacagcat 


ggtgatctgg 


ctgctgggct 


ttcgaatgca 


caggaacccc 


180 


ttctgcatct 


atatcctcaa 


cctggcggca 


gccgacctcc 


tcttcctctt 


cagcatggct 


240 


tccacgcLca 


gcc tggaaac 


ccagcccctg 


gtcaatacca 


ctgacaaggt 


ccacgagctg 


300 


atgaaqaqac 


t gatqtactt 


tgcctacaca 


gtgggcctga 


gcctgctgac 


ggccatcagc 


360 


acccaqcgct 


qtctctctgt 


cctcttccct 


atctggttca 


agtgtcaccg 


gcccaggcac 


420 


ctgtcagccL 


qggtgrgtgg 


cctgctgtgg 


acactctgtc 


tcctgatgaa 


cgggttgacc 


480 


tcttccttct 


gcagcaagtt 


cttgaaattc 


aatgaagatc 


ggtgcttcag 


ggtggacatg 


540 


gtccaggccg 


cccT-catcat 


gggggtctta 


accccagtga 


tgactctgtc 


cagcctgacc 


600 


ctctttgtct 


gqgtqcggag 


gagctcccag 


cagtggcggc 


ggcagcccac 


acggctgttc 


660 


gtggtggtcc 


tggcctctgt 


cctggtgttc 


ctcatctgtt 


ccctgcctct 


gagcatctac 


720 


tggtt tgtgc 


tctactggtt 


gagcctgccg 


cccgagatgc 


aggtcctgtg 


cttcagcttg 


780 


tcacgcccct 


cctcgtccgt 


aagcagcagc 


gccaaccccg 


tcatctactt 


cctggtgggc 


■ 840 


agccggagga 


gccacagqcr 


gcccaccagg 


tccctgggga 


ctgtgctcca 


acaggcgctt 


900 


cgcgaggagc 
gcttga 


ccgagctgga 


aggtggggag 


acgcccaccg 


tgggcaccaa 


tgagatgggg 


960 
966 



<210> 10 

<21i.> 321 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Asn Gin Thr Leu Asn Ser Ser Gly Thr Val Glu Ser Ala Leu Asn 
1 5 10 15 

Tyr Ser Arg Gly Ser Thr Val His Thr Ala Tyr Leu Val Leu Ser Ser 
20 25 30 
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Leu Ala Met Phe Thr Cys Leu Cys Gly Met Ala Gly Asn Ser Met Val 
35 40 45 

lie Trp Leu Leu Gly Phe Arg Met His Arg Asn Pro Phe Cys lie Tyr 
50 55 ' 60 

He Leu Asn Leu Ala Ala Ala Asp Leu Leu Phe Leu Phe Ser Met Ala 
65 "70 75 80 

Ser Thr Leu Ser Leu Glu Thr Gin Pro Leu Val Asn Thr Thr Asp Lys 
85 90 95 

Val His Glu Leu Met Lys Arg Leu Met Tyr Phe Ala Tyr Thr Val Gly 
100 105 110 

Leu Ser Leu Leu Thr Ala lie Ser Thr Gin Arg Cys Leu Ser Val Leu 
115 120 125 

Phe Pro He Trp Phe Lys Cys His Arg Pro Arg His Leu Ser Ala Trp 
130 135 140 

Val Cys Gly Leu Leu Trp Thr Leu Cys Leu Leu Met Asn Gly Leu Thr 
145 150 155 160 

Ser Ser Phe Cys Ser Lys Phe Leu Lys Phe Asn Glu Asp Arg Cys Phe 
165 170 175 

Arg Val Asp Met Val Gin Ala Ala Leu He Met Gly Val Leu Thr Pro 
180 185 190 

Val Met Thr Leu Ser Ser Leu Thr Leu Phe Val Trp Val Arg Arg Ser 
195 200 205 

Ser Gin Gin Trp Arg Arg Gin Pro Thr Arg Leu Phe Val Val Val Leu 
210 215 220 

Ala Ser Val Leu Val Phe Leu He Cys Ser Leu Pro Leu Ser He Tyr 
225 230 235 240 

Trp Phe Val Leu Tyr Trp Leu Ser Leu Pro Pro Glu Met Gin Val Leu 
245 250 255 

Cys Phe Ser Leu Ser Arg Leu Ser Ser Ser Val Ser Ser Ser Ala Asn 
260 265 270 

Pro Val He Tyr Phe Leu Val Gly Ser Arg Arg Ser His Arg Leu Pro 
275 280 285 

Thr Arg Ser Leu Gly Thr Val Leu Gin Gin Ala Leu Arg Glu Glu Pro 
290 295 300 

Glu Leu Glu Gly Gly Glu Thr Pro Thr Val Gly Thr Asn Glu Met Gly 
305 310 315 320 

Ala 



<210> 11 

<211> 1356 

<212> DNA 

<213> Homo sapiens 

<400> 11 

atggagtcct cacccatccc ccagtcatca gggaactctt ccactttggg gagggtccct 60 
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caaaccccag 


gtccctctac 


tgccagtggg 


gtcccggagg 


tggggctacg 


ggatgttgct 


120 


tcggaatctg 


tggccctctt 


cttcatgctc 


ctgctggact 


tgactgctgt 


ggctggcaat 


180 


gccgctgtga 


tggccgtgat 


cgccaagacg 


cctgccctcc 


gaaaatttgt 


cttcgtcttc 


240 


cacctctgcc 


tggtggacct 


gctggctgcc 


ctgaccctca 


tgcccctggc 


catgctctcc 


300 


agctctgccc 


tctttgacca 


cgccctcttt 


ggggaggtgg 


cctgccgcct 


ctacttgttt 


360 


ctgagcgtgt 


gctttgtcag 


cctggccatc 


ctctcggtgt 


cagccatcaa 


tgtggagcgc 


420 


tactattacg 


tagtccaccc 


catgcgctac 


gaggtgcgca 


tgacgctggg 


gctggtggcc 


480 


tctgtgctgg 


tgggtgtgtg 


ggtgaaggcc 


ttggccatgg 


cttctgtgcc 


agtgttggga 


540 


agggtctcct 


gggaggaagg 


agctcccagt 


gtccccccag 


gctgttcact 


ccagtggagc 


600 


cacagtgcct 


actgccagct 


ttttgtggtg 


gtctttgctg 


tcctttactt 


tctgttgccc 


660 


ctgctcctca 


tacttgtggt 


ctactgcagc 


atgttccgag 


tggcccgcgt 


ggctgccatg 


720 


cagcacgggc 


cgctgcccac 


gtggatggag 


acaccccggc 


aacgctccga 


atctctcagc 


780 


agccgctcca 


cgatggtcac 


cagctcgggg 


gccccccaga 


ccaccccaca 


ccggacgttt 


840 


gggggaggga 


aagcagcagt 


ggttctcctg 


gctgtggggg 


gacagttcct 


gctctgttgg 


900 


ttgccctact 


tctctttcca 


cctctatgtt 


gccctgagtg 


ctcagcccat 


ttcaactggg 


960 


caggtggaga 


gtgtggtcac 


ctggattggc 


tacttttgct 


tcacttccaa 


ccctttcttc 


1020 


tatggatgtc 


tcaaccggca 


gatccggggg 


gagctcagca 


agcagtttgt 


ctgcttcttc 


1080 


aagccagctc 


cagaggagga 


gctgaggctg 


cctagccggg 


agggctccat 


tgaggagaac 


1140 


ttcctgcagt 


tccttcaggg 


gactggctgt 


ccttctgagt 


cctgggtttc 


ccgaccccta 


1200 


cccagcccca 


agcaggagcc 


acctgctgtt 


gactttcgaa 


tcccaggcca 


gatagctgag 


1260 


gagacctctg 


agttcctgga 


gcagcaactc 


accagcgaca 


tcatcatgtc 


agacagctac 


1320 


ctccgtcctg 


ccgcctcacc 


ccggctggag 


tcatga 






1356 



<210> 12 

<211> 451 

<212> PRT 

<213> Homo sapiens 

<400> 12 

Met Glu Ser Ser Pro lie Pro Gin Ser Ser Gly Asn Ser Ser Thr Leu 
15 10 15 

Gly Arg Val Pro Gin Thr Pro Gly Pro Ser Thr Ala Ser Gly Val Pro 
20 25 30 

Glu Val Gly Leu Arg Asp Val Ala Ser Glu Ser Val Ala Leu Phe Phe 
35 40 45 

Met Leu Leu Leu Asp Leu Thr Ala Val Ala Gly Asn Ala Ala Val Met 
50 55 60 
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Ala Val lie Ala Lys Thr Pro Ala Leu Arg Lys Phe Val Phe Val Phe 
■ "^0 75 80 

His Leu Cys Leu Val Asp Leu Leu Ala Ala Leu Thr Leu Met Pro Leu 
85 90 95 

Ala Met Leu Ser Ser Ser Ala Leu Phe Asp His Ala Leu Phe Gly Glu 
100 105 110 

Val Ala Cys Arg Leu Tyr Leu Phe Leu Ser Val Cys Phe Val Ser Leu 
115 120 125 

Ala lie Leu Ser Val Ser Ala He Asn Val Glu Arg Tyr Tyr Tyr Val 
130 135 140 

Val His Pro Met Arg Tyr Glu Val Arg Met Thr Leu Gly Leu Val Ala 
^"^^ 150 155 160 

Ser Val Leu Val Gly Val Trp Val Lys Ala Leu Ala Met Ala Ser Val 
165 170 175 

Pro Val Leu Gly Arg Val Ser Trp Glu Glu Gly Ala Pro Ser Val Pro 
180 185 190 

Pro Gly Cys Ser Leu Gin Trp Ser His Ser Ala Tyr Cys Gin Leu Phe 
195 200 205 

Val Vai Val Phe Ala Val Leu Tyr Phe Leu Leu Pro Leu Leu Leu He 
210 215 220 

Leu Val Val Tyr Cys Ser Met Phe Arg Val Ala Arg Val Ala Ala Met 

230 235 240 

Gin His Gly Pro Leu Pro Thr Trp Met Glu Thr Pro Arg Gin Arg Ser 
245 250 255 

Glu Ser Leu Ser Ser Arg Ser Thr Met Val Thr Ser Ser Gly Ala Pro 
260 265 270 

Gin Thr Thr Pro His Arg Thr Phe Gly Gly Gly Lys Ala Ala Val Val 
275 280 285 

Leu Leu Ala Val Gly Gly Gin Phe Leu Leu Cys Trp Leu Pro Tyr Phe 
290 295 300 

Ser Phe His Leu Tyr Val Ala Leu Ser Ala Gin Pro He Ser Thr Glv 
305 310 315 320 

Gin Val Glu Ser Val Val Thr Trp He Gly Tyr Phe Cys Phe Thr Ser 
325 330 335 

Asn Pro Phe Phe Tyr Gly Cys Leu Asn Arg Gin He Arg Gly Glu Leu 
340 345 350 

Ser Lys Gin Phe Val Cys Phe Phe Lys Pro Ala Pro Glu Glu Glu Leu 
355 360 365 

Arg Leu Pro Ser Arg Glu Gly Ser He Glu Glu Asn Phe Leu Gin Phe 
370 375 380 

Leu Gin Gly Thr Gly Cys Pro Ser Glu Ser Trp Val Ser Arg Pro Leu 
385 390 395 400 

Pro Ser Pro Lys Gin Glu Pro Pro Ala Val Asp Phe Arg He Pro Gly 
405 410 415 
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Gin lie Ala Glu Glu Thr Ser Glu Phe Leu Glu Gin Gin Leu Thr Ser 
420 425 430 

Asp lie lie Met: Ser Asp Ser Tyr Leu Arg Pro Ala Ala Ser Pro Arg 
435 440 445 

Leu Glu Ser 
450 

<210> 13 

<211> 1041 

<212> DNA 

<213> Homo sapiens 

<400> 13 



atggagagaa 


aatttatgtc 


cttgcaacca 


tccatctccg 


tatcagaaat 


ggaaccaaat 


60 


ggcaccttca 


gcaataacaa 


cagcaggaac 


tgcacaattg 


aaaacttcaa 


gagagaattt 


120 


ttcccaattg 


tatatctgat 


aatatttttc 


tggggagtct 


tgggaaatgg 


gttgtccata 


180 


tatgttttcc 


tgcagcctta 


taagaagtcc 


acatctgtga 


acgttttcat 


gctaaatctg 


240 


gccatttcag 


atctcctgtt 


cataagcacg 


cttcccttca 


gggctgacta 


ttatcttaga 


300 


ggctccaatt 


ggatatttgg 


agacctggcc 


tgcaggatta 


tgtcttattc 


cttatatcrtc 


3 60 


aacatgtaca 


gcagtattta 


tttcctgacc 


gtgctgagtg 


ttgtgcgttt 


cctggcaatg 


420 


gttcacccct 


ttcggcttct 


gcatgtcacc 


agcatcagga 


gtgcctggat 


cctctgtggg 


480 


atcatatgga 


tccttatcat 


ggcttcctca 


ataatgctcc 


tggacagtgg 


ctctgagcag 


540 


aacggcagtg 


tcacatcatg 


cttagagctg 


aatctctata 


aaattgctaa 


gctgcagacc 


600 


atgaactata 


ttgccttggt 


ggtgggctgc 


ctgctgccat 


ttttcacact 


cagcatctgt 


660 


tatctgctga 


tcattcgggt 


tctgttaaaa 


gtggaggtcc 


cagaatcggg 


gctgcgggtt 


720 


tctcacagga 


aggcactgac 


caccatcatc 


atcaccttga 


tcatcttctt 


cttgtgtttc 


780 


ctgccctatc 


acacactgag 


gaccgtccac 


ttgacgacat 


ggaaagtggg 


tttatgcaaa 


840 


gacagactgc 


ataaagcttt 


ggttatcaca 


ctggccttgg 


cagcagccaa 


tgcctgcttc 


900 


aatcctctgc 


tctattactt 


tgctggggag 


aattttaagg 


acagactaaa 


gtctgcactc 


960 


agaaaaggcc 


atccacagaa 


ggcaaagaca 


aagtgtgttt 


tccctgttag 


tgtgtggttg 


1020 


agaaaggaaa 


caagagtata 


a 








1041 



<210> 14 

<211> 346 

<212> PRT 

<213> Homo sapiens 

<400> 14 

Met Glu Arg Lys Phe Met Ser Leu Gin Pro Ser lie Ser Val Ser Glu 
1 5 10 15 

Met Glu Pro Asn Gly Thr Phe Ser Asn Asn Asn Ser Arg Asn Cys Thr 
20 25 30 
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lie Glu Asn Phe Lys Arg Glu Phe Phe Pro lie Val Tyr Leu lie lie 
35 40 45 

Phe Phe Trp Gly Val Leu Gly Asn Gly Leu Ser lie Tyr Val Phe Leu 
50 55 60 

Gin Pro Tyr Lys Lys Ser Thr Ser Val Asn Val Phe Met Leu Asn Leu 
65 70 75 80 

Ala He Ser Asp Leu Leu Phe He Ser Thr Leu Pro Phe Arg Ala Asp 
85 90 95 

Tyr Tyr Leu Arg Gly Ser Asn Trp He Phe Gly Asp Leu Ala Cys Arg 
100 105 110 

He Met Ser Tyr Ser Leu Tyr Val Asn Met Tyr Ser Ser He Tyr Phe 
115 120 125 

Leu Thr Val Leu Ser Val Val Arg Phe Leu Ala Met Val His Pro Phe 
130 135 140 

Arg Leu Leu His Val Thr Ser He Arg Ser Ala Trp He Leu Cys Gly 
145 150 155 160 

He He Trp He Leu He Met Ala Ser Ser He Met Leu Leu Asp Ser 
165 170 175 

Gly Ser Glu Gin Asn Gly Ser Val Thr Ser Cys Leu Glu Leu Asn Leu 
180 185 190 

Tyr Lys He Ala Lys Leu Gin Thr Met Asn Tyr He Ala Leu Val Val 
195 200 205 

Gly Cys Leu Leu Pro Phe Phe Thr Leu Ser He Cys Tyr Leu Leu He 
210 215 220 

He Arg Val Leu Leu Lys Val Glu Val Pro Glu Ser Gly Leu Arg Val 
225 230 235 240 

Ser His Arg Lys Ala Leu Thr Thr He He He Thr Leu He He Phe 
245 250 255 

Phe Leu Cys Phe Leu Pro Tyr His Thr Leu Arg Thr Val His Leu Thr 
260 265 270 

Thr Trp Lys Val Gly Leu Cys Lys Asp Arg Leu His Lys Ala Leu Val 
275 280 285 

He Thr Leu Ala Leu Ala Ala Ala Asn Ala Cys Phe Asn Pro Leu Leu 
290 295 300 

Tyr Tyr Phe Ala Gly Glu Asn Phe Lys Asp Arg Leu Lys Ser Ala Leu 
305 310 315 320 

Arg Lys Gly His Pro Gin Lys Ala Lys Thr Lys Cys Val Phe Pro Val 
325 330 335 

Ser Val Trp Leu Arg Lys Glu Thr Arg Val 
340 345 

<210> 15 

<211> 1527 

<212> DNA 

<213> Homo sapiens 
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<400> 15 
atgacgtcca 


cctgcaccaa 


cagcacgcgc 


gagagtaaca 


gcagccacac 


gtgca tgccc 


60 


ctctccaaaa 


tgcccatcag 


cctggcccac 


ggcatcatcc 


gctcaaccgt 


gctggttatc 


120 


ttcctcgccg 


cctctttcgt 


cggcaacata 


gtgctggcgc 


tagtgttgca 


gcgcaagccg 


180 


cagctgctgc 


aggtgaccaa 


ccgttttatc 


tttaacctcc 


tcgtcaccga 


cctgctgcag 


24 0 


atttcgctcg 


tggccccctg 


ggtggtggcc 


acctctgtgc 


ctctcttctg 


gcccctcaac 


300 


agccacttct 


gcacggccct 


ggttagcctc 


acccacctgt 


tcgccttcgc 


cagcgtcaac 


3 60 


accattgtcg 


tggtgtcagt 


ggatcgctac 


ttgtccatca 


tccaccctct 


ctcctacccg 


4 20 


tccaagatga 


cccagcgccg 


cggt tacctg 


ctcctctatg 


gcacctggat 


tgtggccatc 


4 o U 


ctgcagagca 


ctcctccact 


ctacggc tgg 


ggccaggctg 


cctttgatga 


^ :a +~ rT 

ycgwcLaugcL. 




ctctgctcca 


tgatctgggg 


ggccagcccc 


agctacacta 


ttctcagcgt 


ggtgtccttc 


DUU 


atcgtcattc 


c a c t ga t t gt 


catgattgcc 


tgct actccg 


tgg tg ti-ccg 


+* ^ « /-^ M « 

X.gCoyCCCyy 


o o u 


aggcagcatg 


ctctgcligta 


caatgt caag 


agacacagct: 


tggaagtgcg 


agt caaggac 


ion 


tgtgtggaga 


atgaggatga 


agagggagca 


gagaagaagg 


aggagttcca 


ggatgagagt 


7 80 


gagtttcgcc 


gccagcatga 


aggtgaggtc 


aaggccaagg 


agggcagaat 


ggaagccaag 


840 


gacggcagcc 


tgaaggccaa 


ggaaggaagc 


acggggacca 


gtgagagtag 


tgtagaggcc 


900 


aggggcagcg 


aggaggtcag 


agagagcagc 


acggtggcca 


gcgacggcag 


catggagggt 


960 


aaggaaggca 


gcaccaaagt 


tgaggagaac 


agcatgaagg 


cagacaaggg 


tcgcacagag 


1020 


gtcaaccagt 


gcagcattga 


cttgggtgaa 


gatgacatgg 


agtttggtga 


agacgacatc 


1080 


aatttcagtg 


aggatgacgt 


cgaggcagtg 


aacatcccgg 


agagcctccc 


acccagtcgt 


1140 


cgtaacagca 


acagcaaccc 


tcctctgccc 


aggtgctacc 


agtgcaaagc 


tgctaaagtg 


1200 


atcttcatca 


tcattttctc 


ctatgtgcta 


tccctggggc 


cctactgctt 


tttagcagtc 


1^ bO 


ctggccgtgt 


gggtggatgt 


cgaaacccag 


gtaccccagt 


gggtgatcac 


cataatcatc 


1320 


tggcttttct 


tcctgcagtg 


ctgcatccac 


ccctatgtct 


atggctacat 


gcacaagacc 


1380 


attaagaagg 


aaatccagga 


catgctgaag 


aagttcttct 


gcaaggaaaa 


gcccccgaaa 


1440 


gaagatagcc 


acccagacct 


gcccggaaca 


gagggtggga 


ctgaaggcaa 


gattgtccct 


1500 


tcctacgatt 


ctgctacttt 


tccttga 








1527 



<210> 16 

<211> 508 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Met Thr Ser Thr Cys Thr Asn Ser Thr Arg Glu Ser Asn Ser Ser His 
15 10 15 
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Thr Cys Met Pro Leu Ser Lys Met Pro He Ser Leu Ala His Gly He 
20 25 30 

He Arg Ser Thr Val Leu Val He Phe Leu Ala Ala Ser Phe Val Gly 
35 40 45 

Asn He Val Leu Ala Leu Val Leu Gin Arg Lys Pro Gin- Leu Leu Gin 
50 55 60 

Val Thr Asn Arg Phe He Phe Asn Leu Leu Val Thr Asp Leu Leu Gin 
65 70 75 80 

He Ser Leu Val Ala Pro Trp Val Val Ala Thr Ser Val Pro Leu Phe 
85 90 95 

Trp Pro Leu Asn Ser His Phe Cys Thr Ala Leu Val Ser Leu Thr His 
100 105 110 

Leu Phe Ala Phe Ala Ser Val Asn Thr He Val Val Val Ser Val Asp 
115 120 125 

Arg Tyr Leu Ser He He His Pro Leu Ser Tyr Pro Ser Lys Met Thr 
130 135 140 

Gin Arg Arg Gly Tyr Leu Leu Leu Tyr Gly Thr Trp He Val Ala He 
145 150 155 160 

Leu Gin Ser Thr Pro Pro Leu Tyr Gly Trp Gly Gin Ala Ala Phe Asp 
165 170 175 

Glu Arg Asn Ala Leu Cys Ser Met He Trp Gly Ala Ser Pro Ser Tyr 
180 185 190 

Thr He Leu Ser Val Val Ser Phe He Val He Pro Leu He Val Met 
195 200 205 

He Ala Cys Tyr Ser Val Val Phe Cys Ala Ala Arg Arg Gin His Ala 
210 215 220 

Leu Leu Tyr Asn Val Lys Arg His Ser Leu Glu Val Arg Val Lys Asp 
225 230 235 240 

Cys Val Glu Asn Glu Asp Glu Glu Gly Ala Glu Lys Lys Glu Glu Phe 
245 250 255 

Gin Asp Glu Ser Glu Phe Arg Arg Gin His Glu Gly Glu Val Lys Ala 
260 265 270 

Lys Glu Gly Arg Met Glu Ala Lys Asp Gly Ser Leu Lys Ala Lys Glu 
275 280 285 

Gly Ser Thr Gly Thr Ser Glu Ser Ser Val Glu Ala Arg Gly Ser Glu 
290 295 300 

Glu Val Arg Glu Ser Ser Thr Val Ala Ser Asp Gly Ser Met Glu Gly 
305 310 315 320 

Lys Glu Gly Ser Thr Lys Val Glu Glu Asn Ser Met Lys Ala Asp Lys 
325 330 335 

Gly Arg Thr Glu Val Asn Gin Cys Ser He Asp Leu Gly Glu Asp Asp 
340 345 350 

Met Glu Phe Gly Glu Asp Asp He Asn Phe Ser Glu Asp Asp Val Glu 
355 360 365 
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Ala Val Asn lie Pro Glu Ser Leu Pro Pro Ser Arg Arg Asn Ser Asn 
370 375 380 

Ser Asn Pro Pro Leu Pro Arg Cys Tyr Gin Cys Lys Ala Ala Lys Val 
385 390 395 400 

lie Phe lie lie lie Phe Ser Tyr Val Leu Ser Leu Gly Pro Tyr Cys 
405 410 415 

Phe Leu Ala Val Leu Ala Val Trp Val Asp Val Glu Thr Gin Val Pro 
420 425 430 

Gin Trp Val lie Thr lie lie lie Trp Leu Phe Phe Leu Gin Cys Cys 
435 440 445 

lie His Pro Tyr Val Tyr Gly Tyr Met His Lys Thr He Lys Lys Glu 
450 455 460 

He Gin Asp Met Leu Lys Lys Phe Phe Cys Lys Glu Lys Pro Pro Lys 
465 470 475 480 

Glu Asp Ser Hi? Vt A;p Leu Pro Gly Thr Glu Gly Gly Thr Glu Gly 
4bi: 4 90 4 95 

Lys He Val Pr^^ r^i Tyr Asp Ser Ala Thr Phe Pro 

bOO 505 

<210> 17 

<211> 1068 

<212> DNA 

<2 1 3 > Homo s p I r, - 

<400> 17 

atgcccttga cgq<«c7TCjt ttcttcattt gaggacctct tggctaacaa tatcctcaga 60 

atatttgtct gggLL.it a^- tttcattacc tgctttggaa atctttttgt cattggcatg 120 

agatctttca ttaaagrtcu aaatacaact cacgctatgt ccatcaaaat cctttgttgc 180 

gctgattgcc tgatgqqLcL LLacttgttc tttgttggca ttttcgatat aaaataccga 240 

gggcagtatc agaactdtgc cLLgctgtgg atggagagcg tgcagtgccg cctcatgggg 300 

ttcctggcca tgctgrccac ::gaagtctct gttctgctac tgacctactt gactttggag 360 

aagttcctgg tcatrgtctL ccccttcagt aacattcgac ctggaaaacg gcagacctca 420 

gtcatcctca tttgcatcLq udtggcggga tttttaatag ctgtaattcc attttggaat 480 

aaggattatt ttggaaact i tratgggaaa aatggagtat gtttcccact ttattatgac 540 

caaacagaag atattggaaq caaagggtat tctcttggaa ttttcctagg tgtgaacttg 600 

ctggcttttc tcatcattgt. actttcctat attactatgt tctgttccat tcaaaaaacc 660 

gccttgcaga ccacagaagL aaggaattgt tttggaagag aggtggctgt tgcaaatcgt 720 

ttctttttta tagtgttctc tgatgccatc tgctggattc ctgtatttgt agttaaaatc 780 

ctttccctct tccgggtgga aataccagac acaatgactt cctggatagt gatttttttc 840 

cttccagtta acagt.gcttt gaatccaatc ctctatactc tcacaaccaa cttttttaag 900 

gacaagttga aacagctgcr gcacaaacat cagaggaaat caattttcaa aattaaaaaa 960 
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aaaagtttat ctacatccat tgtgtggata gaggactcct cttccctgaa acttggggtt 1020 
ttgaacaaaa taacacttgg agacagtata atgaaaccag tttcctag 1068 



<210> 18 
<211> 355 
<212> PRT 
<213> Homo 


sapiens 
























<400> 






























Met 
1 


Pro 


Leu 


Thr 


Asp 

c, 


Gly 


He 


Ser 


Ser 


Phe 
10 


Glu 


Asp 


Leu 


Leu 


Ala 
15 


Asn 


Asn 


lie 


Leu 


Arg 
^ u 


He 


Phe 


Val 


Trp 


Val 
25 


He 


Ala 


Phe 


He 


Thr 

30 


Cys 


Phe 


Gly 


Asn 


Leu 


Phe 


Val 


He 


Gly 


Met 
40 


Arg 


Ser 


Phe 


He 


Lys 
45 


Ala 


Glu 


Asn 


Thr 


Thr 
50 


His 


Ala 


Met 


Ser 


He 
55 


Lys 


He 


Leu 


Cys 


Cys 
60 


Ala 


Asp 


Cys 


Leu 


Met 
65 


Gly 


Val 


Tyr 


Leu 


Phe 
70 


Phe 


Val 


Gly 


He 


Phe 
75 


Asp 


He 


Lys 


Tyr 


Arg 
80 


Gly 


Gin 


Tyr 


Gin 


Lys 


Tyr 


Ala 


Leu 


Leu 


Trp 
90 


Met 


Glu 


Ser 


Val 


Gin 
95 


Cys 


Arg 


Leu 


Met 


Gly 


Phe 


Leu 


Ala 


Met 


Leu 
105 


Ser 


Thr 


Glu 


Val 


Ser 
110 


Val 


Leu 


Leu 


Leu 


Thr 
115 


Tyr 


Leu 


Thr 


Leu 


Glu 
120 


Lys 


Phe 


Leu 


Val 


He 
125 


Val 


Phe 


Pro 


Phe 


Ser 
130 


Asn 


He 


Arg 


Pro 


Gly 
135 


Lys 


Arg 


Gin 


Thr 


Ser 
140 


Val 


He 


Leu 


He 


Cys 
145 


lie 


Trp 


Met 


Ala 


Gly 
150 


Phe 


Leu 


He 


Ala 


Val 
155 


He 


Pro 


Phe 


Trp 


Asn 
160 


Lys 


Asp 


Tyr 


Phe 


Gly 

1 OD 


Asn 


Phe 


Tyr 


Gly 


Lys 
170 


Asn 


Gly 


Val 


Cys 


Phe 
175 


Pro 


Leu 


Tyr 


Tyr 


Asp 
180 


Gin 


Thr 


Glu 


Asp 


He 
185 


Gly 


Ser 


Lys 


Gly 


Tyr 
190 


Ser 


Leu 


Gly 


lie 


Phe 
195 


Leu 


Gly 


Val 


Asn 


Leu 
200 


Leu 


Ala 


Phe 


Leu 


He 
205 


He 


Val 


Phe 


Ser 


Tyr 
210 


He 


Thr 


Met 


Phe 


Cys 
215 


Ser 


He 


Gin 


Lys 


Thr 
220 


Ala 


Leu 


Gin 


Thr 


Thr 

225 


Glu 


Val 


Arg 


Asn 


Cys 
230 


Phe 


Gly 


Arg 


Glu 


Val 
235 


Ala 


Val 


Ala 


Asn 


Arg 
240 


Phe 


Phe 


Phe 


lie 


Val 
245 


Phe 


Ser 


Asp 


Ala 


He 
250 


Cys 


Trp 


He 


Pro 


Val 
255 


Phe 


Val 


Val 


Lys 


He 
260 


Leu 


Ser 


Leu 


Phe 


Arg 
265 


Val 


Glu 


He 


Pro 


Asp 
270 


Thr 


Met 


Thr 


Ser 


Trp 


He 


Val 


He 


Phe 


Phe 


Leu 


Pro 


Val 


Asn 


Ser 


Ala 


Leu 


Asn 
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275 280 285 

Pro lie Leu Tyr Thr Leu Thr Thr Asn Phe Phe Lys Asp Lys Leu Lys 
290 295 300 

Gin Leu Leu His Lys His Gin Arg Lys Ser lie Phe Lys lie Lys Lys 
305 310 315 320 

Lys Ser Leu Ser Thr Ser He Val Trp He Glu Asp Ser Ser Ser Leu 
325 330 335 

Lys Leu Gly Val Leu Asn Lys He Thr Leu Gly Asp Ser He Met Lys 
340 345 350 

Pro Val Ser 
355 



<210> 19 

<211> 969 

<212> DNA 

<213> Homo sapiens 












<400> 19 
atggatccaa 


ccatctcaac 


cttocracaca 

Vp* *— ^ y y CI S^ O 




^ctd Lv_aaOgg 


S ^ ^ +" s ^ y*T a 

aoL.i.gaggag 


bU 


actctttgct 


acaagcagac 


cttgagcctc 


acggtgc tga 






J. ^ u 


gggctgacag 


gaaacgcagt 


tqtQCtctaa 


ctcctgggct 






X o u 


ttctccatct 


acatcctcaa 


cttoonnor'A 

W ^ W ^ ^ k-- KJ 






^ /*T ^ ^~ 

CygcCyCCUL. 


o A n 


atatattccc 


tgttaagctt 


catcagtatc 


ccccatacca 


tctctaaaat 


cctctatcct 


300 


gtgatgatgt: 


tttcctactt 


tgcaggcctg 


agctttctga 


gtgccgtgag 


caccgagcgc 


360 


tgcctgtccg 


tcctgtggcc 


catctggtac 


cgctgccacc 


gccccacaca 


cctgtcagcg 


420 


gtggtgtgtg 


tcctgctctg 


ggccctgtcc 


ctgctgcgga 


gcatcctgga 


gtggatgtta 


480 


tgtggcttcc 


tgttcagtgg 


tgctgattct 


gcttggtgtc 


aaacatcaga 


tttcatcaca 


540 


gtcgcgtggc 


tgattttttt 


atgtgtggtt 


ctctgtgggt 


ccagcctggt 


cctgctgatc 


600 


aggattctct 


gtggatcccg 


gaagataccg 


ctgaccaggc 


tgtacgtgac 


catcctgctc 


660 


acagtactgg 


tcttcctcct 


ctgtggcctg 


ccctttggca 


ttcagttttt 


cctattttta 


720 


tggatccacg 


tggacaggga 


agtcttattt 


tgtcatgttc 


atctagtttc 


tattttcctg 


780 


tccgctctta 


acagcagtgc 


caaccccatc 


atttacttct 


tcgtgggctc 


ctttaggcag 


840 


cgtcaaaata 


ggcagaacct 


gaagctggtt 


ctccagaggg 


ctctgcagga 


cgcgtctgag 


900 


gtggatgaag 


gtggagggca 


gcttcctgag 


gaaatcctgg 


agctgtcggg 


aagcagattg 


960 


gagcagtga 












969 



<210> 20 

<211> 322 

<212> PRT 

<213> Homo sapiens 

<400> 20 
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Met Asp Pro Thr lie 
1 5 

Gly Thr Glu Glu Thr 
20 

Leu Thr Cys lie Val 
35 

Leu Trp Leu Leu Gly 
50 

lie Leu Asn Leu Ala 
65 

lie Tyr Ser Leu Leu 
85 

lie Leu Tyr Pro Val 
100 

Leu Ser Aia V«i 1 Scr 
1 1 b 

Trp Tyr Arg Cys His 
130 

Leu Leu Trp Alu l^u 
145 

Cys Gly Phe L«-u Tr%^ 
1 c \, 

Asp Phe He TM Vu i 
IPO 

Gly Ser Ser L^mj Va i 

195 

lie Pro Leu Thr Arc? 
210 

Phe Leu Leu Cys G 1 y 
225 

Trp He His Val Asp 
245 

Ser He Phe Leu Ser 
260 

Phe Phe Val Gly Sor 
275 

Leu Val Leu Gin Arg 
290 

Gly Gly Gin Leu Pro 
305 

Glu Gin 



Ser Thr Leu Asp Thr Glu 
10 

Leu Cys Tyr Lys Gin Thr 
25 

Ser Leu Val Gly Leu Thr 
40 



Cys Arg Met Arg Arg Asn 
55 



Ala Ala Asp Phe Leu Phe 
70 75 

Ser Phe He Ser He Pro 
90 



Met Met Phe Ser Tyr Phe 
105 

Thr Glu Arg Cys Leu Ser 
120 



Arq Pro Thr His Leu Ser 
135 



Sor L^u Leu Arg Ser He 
150 155 

Sor Gly Ala Asp Ser Ala 
170 

a;^ Trp Leu He Phe Leu 
185 

Lou Leu He Arg He Leu 
200 



Leu Tyr Val Thr He Leu 
215 



Leu Pro Phe Gly He Gin 
230 235 

Arg Glu Val Leu Phe Cys 
250 

Ala Leu Asn Ser Ser Ala 
265 

Phe Arg Gin Arg Gin Asn 
280 



Ala Leu Gin Asp Ala Ser 
295 

Glu Glu He Leu Glu Leu 
310 315 



Leu Thr Pro He Asn 
15 

Leu Ser Leu Thr Val 
30 

Gly Asn Ala Val Val 
45 

Ala Phe Ser He Tyr 
60 

Leu Ser Gly Arg Leu 
80 

His Thr He Ser Lys 
95 

Ala Gly Leu Ser Phe 
110 

Val Leu Trp Pro He 
125 

Ala Val Val Cys Val 
140 

Leu Glu Trp Met Leu 
160 

Trp Cys Gin Thr Ser 
175 

Cys Val Val Leu Cys 
190 

Cys Gly Ser Arg Lys 
205 

Leu Thr Val Leu Val 
220 

Phe Phe Leu Phe Leu 
240 

His Val His Leu Val 
255 

Asn Pro He He Tyr 
270 

Arg Gin Asn Leu Lys 

285 

Glu Val Asp Glu Gly 
300 

Ser Gly Ser Arg Leu 
320 



<210> 
<211> 



21 

1305 
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<212> DNA 

<213> Homo sapiens 

<400> 21 



atggaggatc 


tctttagccc 


ctcaattctg 


ccgccggcgc 


ccaacatttc 


cgtgcccatc 


60 


ttgctgggct 


ggggtctcaa 


cctgaccttg 


gggcaaggag 


cccctgcctc 


tgggccgccc 


120 


agccgccgcg 


tccgcctggt 


gttcctgggg 


gtcatcctgg 


tggtggcggt 


ggcaggcaac 


180 


accacagtgc 


tgtgccgcct 


gtgcggcggc 


ggcgggccct 


gggcgggccc 


caagcgtcgc 


240 


aagatggact 


tcctgctggt 


gcagctggcc 


ctggcggacc 


tgtacgcgtg 


cgggggcacg 


300 


gcgctgtcac 


agctggcctg 


ggaactgctg 


ggcgagcccc 


gcgcggccac 


gggggacctg 


360 


gcgtgccgct 


tcctgcagct 


gctgcaggca 


tccgggcggg 


gcgcctcggc 


ccacctcgtg 


420 


gtgctcatcg 


ccctcgagcg 


ccggcgcgcg 


gtgcgtcttc 


cgcacggccg 


gccgctgccc 


480 


gcgcgt gccc 


tcgccgccct 


gggctggctg 


ctggcactgc 


tgctggcgct 


gcccccggcc 


540 


crcgtggtgc 


gcggggactc 


cccctcgccg 


ctgccgccgc 


cgccgccgcc 


aacgtccctg 


600 


cagccoggcg 


cgcccccggc 


cgcccgcgcc 


tggccggggg 


agcgtcgctg 


ccacgggatc 


660 


t t cgcgcccc 


cgccgcgctg 


gcacctgcag 


gtctacgcgt 


tctacgaggc 


cgtcgcgggc 


720 


t tcgt cqcqc 


ctg t tacggt 


cctgggcgtc 


gcttgcggcc 


acctactctc 


cgtctggtgg 


780 


cgqcaccggc 


cqccjogcccc 


cgcggctgca 


gcgccctggt 


cggcgagccc 


aggtcgagcc 


840 


cctqcgccca 


ccgcgctgcc 


ccgcgccaag 


gtgcagagcc 


tgaagatgag 


cctgctgctg 


900 


gcgc t gctg t 


tccLggqctg 


cgagctgccc 


tactttgccg 


cccggctggc 


ggccgcgtgg 


960 


tcgcccgggc 


ccgcgggaga 


ctgggaggga 


gagggcctgt 


cggcggcgct 


gcgcgtggtg 


1020 


qcga t ggccti 


acagcgctct 


caatcccttc 


gtctacctct 


tcttccaggc 


gggcgactgc 


1080 


cggctccggc 


gdcagctgcg 


gaagcggctg 


ggctctctgt 


gctgcgcgcc 


gcagggaggc 


1140 


gcggaggacg 


aggaggggcc 


ccggggccac 


caggcgctct 


accgccaacg 


ctggccccac 


1200 


cctca 1 1 a tc 


accat gcLcg 


gcgggaaccg 


ctggacgagg 


gcggcttgcg 


cccaccccct 


1260 


ccgcgcccca 


gacccctgcc 


ttgctcctgc 


gaaagtgcct 


tctag 




1305 



<210> 22 

<211> 434 

' <212> PRT 

<213> Homo sapiens 

<400> 22 

Met Glu Asp Leu Phe Ser Pro Ser lie Leu Pro Pro Ala Pro Asn lie 
15 10 15 

Ser Val Pro He Leu Leu Gly Trp Gly Leu Asn Leu Thr Leu Gly Gin 
20 25 30 

Gly Ala Pro Ala Ser Gly Pro Pro Ser Arg Arg Val Arg Leu Val Phe 
35 40 45 
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Leu Gly Val He Leu Val Val Ala Val Ala Gly Asn Thr Thr Val Leu 
50 55 60 

Cys Arg Leu Cys Gly Gly Gly Gly Pro Trp Ala Gly Pro Lys Arg Arg 
65 70 75 90 

Lys Met Asp Phe Leu Leu Val Gin Leu Ala Leu Ala Asp Leu Tyr Ala 
85 90 95 

Cys Gly Gly Thr Ala Leu Ser Gin Leu Ala Trp Glu Leu Leu Gly Glu 
100 105 110 

Pro Arg Ala Ala Thr Gly Asp Leu Ala Cys Arg Phe Leu Gin Leu Leu 
115 120 125 

Gin Ala Ser Gly Arg Gly Ala Ser Ala His Leu Val Val Leu He Ala 
130 135 140 

Leu Glu Arg Arg Arg Ala Val Arg Leu Pro His Gly Arg Pro Leu Pro 
145 150 155 160 

Ala Arg Ala Leu Ala Ala Leu Gly Trp Leu Leu Ala Leu Leu Leu Ala 
165 170 175 

Leu Pro Pro Ala Phe Val Val Arg Gly Asp Ser Pro Ser Pro Leu Pro 
180 185 190 

Pro Pro Pro Pro Pro Thr Ser Leu Gin Pro Gly Ala Pro Pro Ala Ala 
195 200 205 

Arg Ala Trp Pro Gly Glu Arg Arg Cys His Gly He Phe Ala Pro Leu 
210 215 220 

Pro Arg Trp His Leu Gin Val Tyr Ala Phe Tyr Glu Ala Val Ala Gly 
225 230 235 240 

Phe Val Ala Pro Val Thr Val Leu Gly Val Ala Cys Gly His Leu Leu 
245 250 255 

Ser Val Trp Trp Arg His Arg Pro Gin Ala Pro Ala Ala Ala Ala Pro 
260 265 270 

Trp Ser Ala Ser Pro Gly Arg Ala Pro Ala Pro Ser Ala Leu Pro Arg 
275 280 285 

Ala Lys Val Gin Ser Leu Lys Met Ser Leu Leu Leu Ala Leu Leu Phe 
290 . 295 300 

Val Gly Cys Glu Leu Pro Tyr Phe Ala Ala Arg Leu Ala Ala Ala Trp 
305 310 315 320 

Ser Ser Gly Pro Ala Gly Asp Trp Glu Gly Glu Gly Leu Ser Ala Ala 
325 330 335 

Leu Arg Val Val Ala Met Ala Asn Ser Ala Leu Asn Pro Phe Val Tyr 
340 345 350 

Leu Phe Phe Gin Ala Gly Asp Cys Arg Leu Arg Arg Gin Leu Arg Lys 
355 360 365 

Arg Leu Gly Ser Leu Cys Cys Ala Pro Gin Gly Gly Ala Glu Asp Glu 
370 375 380 

Glu Gly Pro Arg Gly His Gin Ala Leu Tyr Arg Gin Arg Trp Pro His 
385 390 395 400 

Page 23 



BNSOOCID: <WO 0136471 A2_l_> 



wo 01/36471 



PCT/USOO/31509 



Pro His Tyr His His Ala Arg Arg Glu Pro Leu Asp Glu Gly Gly Leu 
40b 410 ' 415 

Arg Pro Pro Pro Pro Arg Pro Arg Pro Leu Pro Cys Ser Cys Glu Ser 
420 425 430 

Ala Phe 



<210> 23 

<211> 1041 

<212> DNA 

<213> Homo sapiens 

<400> 23 



atgtacaacg 


ggtcgtgctg 


ccgcatcgag 


ggggacacca 


tctcccaggt 


gatgccgccg 


60 


ctgctcattg 


tggcctttgt 


gctgggcgca 


ctaggcaatg 


gggtcgccct 


gtgtggtttc 


120 


tgcttccaca 


tgaagacctg 


gaagcccagc 


actgtttacc 


ttttcaattt 


ggccgtggct 


180 


gatttcctcc 


ttatgatctg 


cctgcctttt 


cggacagact 


attacctcag 


acgtagacac 


240 


tgggcttttg 


gggacattcc 


ctgccgagtg 


gggctcttca 


cgttggccat 


gaacagggcc 


300 


gggagcat eg 


tgttccttac 




gcggacaggt 


atttcaaagt 


ggtccacccc 


360 


caccacgcgg 


tgaacactat 


ctccacccgg 


gtggcggctg 


gcatcgtctg 


caccctgtgg 


420 


gccctggtca 


tcctgggaac 


agtgtatctt 


ttgctggaga 


accatctctg 


cgtgcaagag 


480 


acggccgtct 


cctgtgagag 


cttcatcatg 


gagtcggcca 


atggctggca 


tgacatcatg 


540 


ttccagctgg 


agttctttat 


gcccctcggc 


atcatcttat 


tttgctcctt 


caagattgtt 


600 


tggagcctga 


ggcggaggca 


gcagctggcc 


agacaggctc 


ggatgaagaa 


ggcgacccgg 


660 


ttcatcatgg 


tggtggcaat 


tgtgttcatc 


acatgctacc 


tgcccagcgt 


gtctgctaga 


720 


ctctatttcc 


tctggacggt 


gccctcgagt 


gcctgcgatc 


cctctgtcca 


tggggccctg 


780 


cacataaccc 


tcagcttcac 


ctacatgaac 


agcatgctgg 


atcccctggt 


gtattatttt 


840 


tcaagcccct 


cctttcccaa 


attctacaac 


aagctcaaaa 


tctgcagtct 


gaaacccaag 


900 


cagccaggac 


actcaaaaac 


acaaaggccg 


gaagagatgc 


caatttcgaa 


cctcggtcgc 


960 


aggagttgca 


tcagtgtggc 


aaatagtttc 


caaagccagt 


ctgatgggca 


atgggatccc 


1020 


cacattgttg 


agtggcactg 


a 








1041 



<210> 24 

<211> 346 

<212> PRT 

<213> Homo sapiens 

<400> 24 

Met Tyr Asn Gly Ser Cys Cys Arg lie Glu Gly Asp Thr lie Ser Gin 
15 10 15 

Val Met Pro Pro Leu Leu lie Val Ala Phe Val Leu Gly Ala Leu Gly 
20 25 30 
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Asn Gly Val Ala Leu Cys Gly Phe Cys Phe His Met Lys Thr Trp Lys 
J= 40 45 

Pro Ser Thr Val Tyr Leu Phe Asn Leu Ala Val Ala Asp Phe Leu Leu 

55 60 

Met He Cys Leu Pro Phe Arg Thr Asp Tyr Tyr Leu Arg Arg Arg His 



80 



Trp Ala Phe Gly Asp He Pro Cys Arg Val Gly Leu Phe Thr Leu Ala 
- 90 95 

Met Asn Arg Ala Gly Ser He Val Phe Leu Thr Val Val Ala Ala Asp 
100 105 2.10 

Arg Tyr Phe Lys Val Val His Pro His His Ala Val Asn Thr He Ser 
115 120 125 

Thr Arg Val Ala Ala Gly He Val Cys Thr Leu Trp Ala Leu Val He 
•iJU 135 

Leu Gly Thr Val Tyr Leu Leu Leu Glu Asn His Leu Cys Val Gin Glu 
^ ^50 155 

Thr Ala Val Scr Cys Glu Ser Phe He Met Glu Ser Ala Asn Gly Trp 
165 170 

His Asp He Met Phe Gin Leu Glu Phe Phe Met Pro Leu Gly He He 
ISO 185 190 

Leu Phe Cys Scr Phe Lys He Val Trp Ser Leu Arg Arg Arg Gin Gin 
1 200 205 

Leu Ala Arg Gin Ala Arg Met Lys Lys Ala Thr Arg Phe He Met Val 

215 220 

Val Ala He Val Phe He Thr Cys Tyr Leu Pro Ser Val Ser Ala Arg 

230 235 240 

Leu Tyr Phe Leu Trp Thr Val Pro Ser Ser Ala Cys Asp Pro Ser Val 
2''5 250 255 

His Gly Ala Leu His He Thr Leu Ser Phe Thr Tyr Met Asn Ser Met 
260 ■ 265 270 

Leu Asp Pro Leu Val Tyr Tyr Phe Ser Ser Pro Ser Phe Pro Lys Phe 
2T5 280 285 

oon Gin Pro Gly His 

295 300 

Ser Lys Thr Gin Arg Pro Glu Glu Met Pro He Ser Asn Leu Gly Arg 

310 315 320 

Arg Ser Cys He Ser Val Ala Asn Ser Phe Gin Ser Gin Ser Asp Gly 
325 330 

Gin Trp Asp Pro His lie Val Glu Trp His 
340 345 

<210> 25 

<211> 1011 

<212> DNA 

<213> Homo sapiens 

Page 25 



BNSDOCID: <WO_0136471A2.I > 



wo 01/36471 



PCT/USOO/31509 



<400> 25 
atgaacaaca 


atacaacatg 


tattcaacca 


\f o ^ \^ u \r 


A" '\' r"" f" za. t- ry 


ttt-dccaauc 


60 


attitacat cc 


tcctttgtat 


tgttggtgtt 


^ y y o a ci ^— o 




d uyydudtL-L. 




ttaacaaaaa 


taggtaaaaa 


aacatcaacg 


cacatctacc 




L.y LydCL.gv_.d 


ion 


aacttacttg 


t at a c a Q t Q c 


catgcctttc 


ataacrtatct 


u^- ^^\vk.yoa 


dyy u V, 1-L.Cdd 




tgggaatatc 


aatctgctca 


at crcaaaata 


gtcaattttc 




d UL.L.d uy^dL. 




gcaagtatgt 


ttgtcagtct 


ct taa 1 1 tta 






ctatgctacc 


T C A 


ttaatgcaaa 


aggattcctc 


y ^-i* u ^ ^ w ^i,i> 




d uydydddd L, 


attttatggc 




ca t tt act ga 


aaaaatttcg 






a a ^ ^ a ^ ^ s ^ 

ddCCdLyCdU 


L.t.acauaT_gg 


4 bO 




+• ff ^ 1~ a 3 +" 


a t~ "t~ 3 ^ 

L-auuccoytt. 


accgtatact 


actcagtcat 


agaggctaca 


540 




a^d^u.^ L.CI 




cagatggaac 


taggagccat 


gatct ctcag 


600 








gyatuutcct 


uuTiL.agtiagL. 


actaacatca 


660 










gtacgtccat 


tatggagaaa 


720 


gatttgactt 


acagttctgt 


gaaaagacat 


cttttggtca 


tccagattct 


actaatagtt 


780 


tgcttccttc 


cttatagtat 


ttttaaaccc 


attttttatg 


ttctacacca 


aagagataac 


840 


tgtcagcaat 


tgaattattt 


aatagaaaca 


aaaaacattc 


tcacctgtct 


tgcttcggcc 


900 


agaagtagca 


cagaccccat 


tatatttctt 


ttattagata 


aaacattcaa 


gaagacacta 


960 


tataatctct 


ttacaaagtc 


taattcagca 


"catatgcaat 


catatggttg 


a 


1011 



<210> 26 

<211> 336 

<212> PRT 

<213> Homo sapiens 

<400> 26 

Met Asn Asn Asn Thr Thr Cys lie Gin Pro Ser Met lie Ser Ser Met 
15 10 15 

Ala Leu Pro lie lie Tyr lie Leu Leu Cys lie Val Gly Val Phe Gly 
20 25 30 

Asn Thr Leu Ser Gin Trp He Phe Leu Thr Lys He Gly Lys Lys Thr 
35 40 45 

Ser Thr His lie Tyr Leu Ser His Leu Val Thr Ala Asn Leu Leu Val 
50 55 60 

Cys Ser Ala Met Pro Phe Met Ser He Tyr Phe Leu Lys Gly Phe Gin 
65 70 75 80 

Trp Glu Tyr Gin Ser Ala Gin Cys Arg Val Val Asn Phe Leu Gly Thr 
85 90 95 

Leu Ser Met His Ala Ser Met Phe Val Ser Leu Leu He Leu Ser Trp 
100 105 110 
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lie Ala He Ser Arg Tyr Ala Thr Leu Met Gin Lys Asp Ser Ser Gin 
115 120 125 

Glu Thr Thr Ser Cys Tyr Glu Lys He Phe Tyr Gly His Leu Leu Lys 
130 135 140 

Lys Phe Arg Gin Pro Asn Phe Ala Arg Lys Leu Cys He Tyr He Trp 
145 150 155 160 

Gly Val Val Leu Gly He He -He Pro Val Thr Val Tyr Tyr Ser Val 
165 170 175 

He Glu Ala Thr Glu Gly Glu Glu Ser Leu Cys Tyr Asn Arg Gin Met 
180 185 190 

Glu Leu Gly Ala Met He Ser Gin He Ala Gly Leu He Gly Thr Thr 
195 200 205 

Phe He Gly Phe Ser Phe Leu Val Val Leu Thr Ser Tyr Tyr Ser Phe 
210 215 220 

Val Ser His Leu Arg Lys He Arg Thr Cys Thr Ser He Met Glu Lys 
225 230 235 240 

Asp Leu Thr Tyr Ser Ser Val Lys Arg His Leu Leu Val He Gin He 
245 250 255 

Leu Leu He Val Cys Phe Leu Pro Tyr Ser He Phe Lys Pro He Phe 
260 265 270 

Tyr Val Leu His Gin Arg Asp Asn Cys Gin Gin Leu Asn Tyr Leu He 
275 280 285 

Glu Thr Lys Asn He Leu Thr Cys Leu Ala Ser Ala Arg Ser Ser Thr 
290 295 300 

Asp Pro He He Phe Leu Leu Leu Asp Lys Thr Phe Lys Lys Thr Leu 
305 310 315 320 

Tyr Asn Leu Phe Thr Lys Ser Asn Ser Ala His Met Gin Ser Tyr Gly 





325 




330 




335 




<210> 27 
<211> 1014 
<212>' DNA 
<213> Homo 


sapiens 












<400> 27 
atgaatgagc 


cactagacta 


tttagcaaat 


gcttctgatt 


tccccgatta 


tgcagctgct 


60 


tttggaaatt 


gcactgatga 


aaacatccca 


ctcaagatgc 


actacctccc 


tgttatttat 


120 


ggcattatct 


tcctcgtggg 


atttccaggc 


aatgcagtag 


tgatatccac 


ttacattttc 


180 


aaaatgagac 


cttggaagag 


cagcaccatc 


attatgctga 


acctggcctg 


cacagatctg 


240 


ctgtatctga 


ccagcctccc 


cttcctgatt 


cactactatg 


ccagtggcga 


aaactggatc 


300 


tttggagatt 


tcatgtgtaa 


gtttatccgc 


ttcagcttcc 


atttcaacct 


gtatagcagc 


360 


atcctcttcc 


tcacctgttt 


cagcatcttc 


cgctactgtg 


tgatcattca 


cccaatgagc 


420 


tgcttttcca 


ttcacaaaac 


tcgatgtgca 


gttgtagcct 


gtgctgtggt 


gtggatcatt 


480 


tcactggtag 


ctgtcattcc 


gatgaccttc 


ttgatcacat 


caaccaacag 


gaccaacaga 


540 
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tcagcctgtc 


tcgacctcac 


cag t tcggat 


gaactcaata 


ctat taagtg 


gtacaacctg 


600 


attttgactg 


caactact t t 


ctgcctcccc 


ttggtgatag 


tgacactt tg 


ctataccacg 


660 


attatccaca 


ctctgaccca 


tggact gcaa 


actgacagct 


gccttaagca 


gaaagcacga 


720 


aggctaacca 


ttctgctact 


ccttgcattt 


tacgtatgtt 


ttttaccctt 


ccatatcttg 


780 


agggtcattc 


ggatcgaatc 


tcgcctgctt 


tcaatcagtt 


gttccattga 


gaatcagatc 


840 


catgaagctt 


acatcgtttc 


tagaccatta 


gctgctctga 


acacctttgg 


taacctgtta 


900 


ctatatgtgg 


tggtcagcga 


caactttcag 


caggctgtct 


gctcaacagt 


gagatgcaaa 


960 


gtaagcggga 


accttgagca 


agcaaagaaa 


attagttact 


caaacaaccc 


ttga 


1014 



<210> 28 

<211> 337 

<212> PRT 

<213> Homo sapiens 

<400> 28 

Met Asn Glu Pro Leu Asp Tyr Leu Ala Asn Ala Ser Asp Phe Pro Asp 
15 10 15 

Tyr Ala Ala Ala Phe Gly Asn Cys Thr Asp Glu Asn lie Pro Leu Lys 
20 25 30 

Met His Tyr Leu Pro Val lie Tyr Gly lie lie Phe Leu Val Gly Phe 
35 40 45 

Pro Gly Asn Ala Val Val lie Ser Thr Tyr He Phe Lys Met Arg Pro 
50 55 60 

Trp Lys Ser Ser Thr He lie Met Leu Asn Leu Ala Cys Thr Asp Leu 
65 70 75 80 

Leu Tyr Leu Thr Ser Leu Pro Phe Leu He His Tyr Tyr Ala Ser Gly 
85 90 95 

Glu Asn Trp He Phe Gly Asp Phe Met Cys Lys Phe He Arg Phe Ser 
100 105 110 

Phe His Phe Asn Leu Tyr Ser Ser He Leu Phe Leu Thr Cys Phe Ser 
115 120 125 

He Phe Arg Tyr Cys Val He He His Pro Met Ser Cys Phe Ser He 
130 135 140 

His Lys Thr Arg Cys Ala Val Val Ala Cys Ala Val Val Trp He He 
145 150 155 160 

Ser Leu Val Ala Val He Pro Met Thr Phe Leu He Thr Ser Thr Asn 
165 170 175 

Arg Thr Asn Arg Ser Ala Cys Leu Asp Leu Thr Ser Ser Asp Glu Leu 
180 185 190 

Asn Thr He Lys Trp Tyr Asn Leu He Leu Thr Ala Thr Thr Phe Cys 
195 200 205 

Leu Pro Leu Val He Val Thr Leu Cys Tyr Thr Thr He He His Thr 
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210 215 220 

Leu Thr His Gly Leu Gin Thr Asp Ser Cys Leu Lys Gin Lys Ala Arq 
225 230 235 240 

Arg Leu Thr He Leu Leu Leu Leu Ala Phe Tyr Val Cys Phe Leu Pro 
245 250 255 

Phe His He Leu Arg Val He Arg He Glu Ser Arg Leu Leu Ser He 
260 265 270 

Ser Cys Ser He Glu Asn Gin He His Glu Ala Tyr He Val Ser Arg 
275 280 285 

Pro Leu Ala Ala Leu Asn Thr Phe Gly Asn Leu Leu Leu Tyr Val Val 
290 295 300 

Val Ser Asp Asn Phe Gin Gin Ala Val Cys Ser Thr Val Arg Cys Lys 
305 310 315 320 

Val Ser Gly Asn Leu Glu Gin Ala Lys Lys He Ser Tyr Ser Asn Asn 
325 330 335 

Pro 

<210> 29 

<211> 993 

<212> DNA 

<213> Homo sapiens 

<400> 29 

atggatccaa ccaccccggc ctggggaaca gaaagtacaa cagtgaatgg aaatgaccaa 60 

gcccttcttc tgctttgtgg caaggagacc ctgatcccgg tcttcctgat ccttttcatt 120 

gccctggtcg ggctggtagg aaacgggttt gtgctctggc tcctgggctt ccgcatgcgc 180 

aggaacgcct tctctgtcta cgtcctcagc ctggccgggg ccgacttcct cttcctctgc 240 

ttccagatta taaattgcct ggtgtacctc agtaacttct tctgttccat ctccatcaat 300 

ttccctagct tcttcaccac tgtgatgacc tgtgcctacc ttgcaggcct gagcatgctg 360 

agcaccgtca gcaccgagcg ctgcctgtcc gtcctgtggc ccatctggta tcgctgccgc 420 

cgccccagac acctgtcagc ggtcgtgtgt gtcctgctct gggccctgtc cctactgctg 480 

agcatcttgg aagggaagtt ctgtggcttc ttatttagtg atggtgactc tggttggtgt 540 

cagacatttg atttcatcac tgcagcgtgg ctgatttttt tattcatggt tctctgtggg 600 

tccagtctgg ccctgctggt caggatcctc tgtggctcca ggggtctgcc actgaccagg 660 

ctgtacctga ccatcctgct cacagtgctg gtgttcctcc tctgcggcct gccctttggc 720 

attcagtggt tcctaatatt atggatctgg aaggattctg atgtcttatt ttgtcatatt 780 

catccagttt cagttgtcct gtcatctctt aacagcagtg ccaaccccat catttacttc 840 

ttcgtgggct cttttaggaa gcagtggcgg ctgcagcagc cgatcctcaa gctggctctc 900 

cagagggctc tgcaggacat tgctgaggtg gatcacagtg aaggatgctt ccgtcagggc 960 

accccggaga tgtcgagaag cagtctggtg tag 993 
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<210> 30 

<211> 330 

<212> PRT 

<213> Homo sapiens 

<400> 30 

Met Asp Pro Thr Thr Pro Ala Trp Gly Thr Glu Ser Thr Thr Val Asn 
15 10 15 

Gly Asn Asp Gin Ala Leu Leu Leu Leu Cys Gly Lys Glu Thr Leu lie 
20 25 30 

Pro Val Phe Leu lie Leu Phe lie Ala Leu Val Gly Leu Val Gly Asn 
35 40 45 

Gly Phe Val Leu Trp Leu Leu Gly Phe Arg Met Arg Arg Asn Ala Phe 
50 55 60 

Ser Val Tyr Va 1 I^u Sor Leu Ala Gly Ala Asp Phe Leu Phe Leu Cys 
65 75 80 

Phe Gin lie Asn C/r. Leu Val Tyr Leu Ser Asn Phe Phe Cys Ser 

Bl 90 95 

lie Ser lie A^r. J-r^r n^r Phe Phe Thr Thr Val Met Thr Cys Ala 

lOu 105 110 

Tyr Leu Ala Gly Lo-- r>*?r Met Leu Ser Thr Val Ser Thr Glu Arg Cys 
115 120 125 

Leu Ser Val Leu Tzp Pro lie Trp Tyr Arg Cys Arg Arg Pro Arg His 
130 135 140 

Leu Ser Ala Val v^l Cys Val Leu Leu Trp Ala Leu Ser Leu Leu Leu 
145 I'-'O 155 160 

Ser lie Leu Glu Gly Lys Phe Cys Gly Phe Leu Phe Ser Asp Gly Asp 
16^ 170 175 

Ser Gly Trp Cys Gin Thr Phe Asp Phe lie Thr Ala Ala Trp Leu lie 
180 185 190 

Phe Leu Phe Met Val L«u Cys Gly Ser Ser Leu Ala Leu Leu Val Arg 
195 200 205 

lie Leu Cys Gly Ser Arg Gly Leu Pro Leu Thr Arg Leu Tyr Leu Thr 
210 215 220 

lie Leu Leu Thr Val Leu Val Phe Leu Leu Cys Gly Leu Pro Phe Gly 
225 ::30 235 240 

He Gin Trp Phe Leu lie Leu Trp Tie Trp Lys Asp Ser Asp Val Leu 
24b 250 255 

Phe Cys His He His Pro Val Ser Val Val Leu Ser Ser Leu Asn Ser 
260 265 270 

Ser Ala Asn Pro He He Tyr Phe Phe Val Gly Ser Phe Arg Lys Gin 
275 280 285 

Trp Arg Leu Gin Gin Pro He Leu Lys Leu Ala Leu Gin Arg Ala Leu 
290 295 300 
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Gin Asp lie Ala Glu Val Asp His Ser Glu Gly Cys Phe Arg Gin Gly 
305 310 315 320 

Thr Pro Glu Met Ser Arg Ser Ser Leu Val 
325 330 

<210> 31 

<211> 1092 

<212> DNA 

<213> Homo sapiens 

<400> 31 







gc uggcgggu 


(-H-Cugyuya 


tggtactggc 


cgtggcgctg 


60 






ycLT_uyLT_gc 


gcctacagcg 


ct gagctccg 


cactcgagcc 


120 




+~ /~T f-r +~ "> 


x.CL.gx.cgcL.g 


ggccacctgc 


tgctggcggc 


gctggaca tg 


180 


^ 1. ^ w CL u ^ 


1"rrr't"r'f*TfT"t"ri't" 
^yv^ u-v^vjy ^y ^ 


ga tgcgcggg 


(_yyciCaCCyc 


cggcgcccgg 


cgcatgccaa 


2 4 0 






t^u L^u.i.yyt.y 


uccaacgcgg 


cgc tgagcgt 


ggcggcgctg 


300 






ay Lyyyd.L.c 


ccacbgcgct 


acgccggacg 


cctgcgaccg 


360 




y^^cyt— uyv.'L. 


yyyv-v-yL-ycc 


tggggacagt 


cgctggcctt 


ctcaggcgct 


420 




yt. L(— y L.yyi,,L 


uyyo^uacagc 


agcgccttcg 


cgtcctgttc 


gctgcgcctg 


4 80 


ccgcccgagc 


ctgagcgtcc 


gcgcttcgca 


gccttcaccg 


ccacgctcca 


tgccgtgggc 


540 


ttcgtgctgc 


cgctggcggt 


gctctgcctc 


acctcgctcc 


aggtgcaccg 


ggtggcacgc 


600 


agccactgcc 


agcgcatgga 


caccgtcacc 


atgaaggcgc 


tcgcgctgct 


cgccgacctg 


660 


caccccagtg 


tgcggcagcg 


ctgcctcatc 


cagcagaagc 


ggcgccgcca 


ccgcgccacc 


720 


aggaagattg 


gcattgctat 


tgcgaccttc 


ctcatctgct 


ttgccccgta 


tgtcatgacc 


780 


aggctggcgg 


agctcgtgcc 


cttcgtcacc 


gtgaacgccc 


agtggggcat 


cctcagcaag 


840 


tgcctgacct 


acagcaaggc 


ggtggccgac 


ccgttcacgt 


actctctgct 


ccgccggccg 


900 


ttccgccaag 


tcctggccgg 


catggtgcac 


cggctgctga 


agagaacccc 


gcgcccagca 


960 


tccacccatg 


acagctctct 


ggatgtggcc 


ggcatggtgc 


accagctgct 


gaagagaacc 


1020 


ccgcgcccag 


cgtccaccca 


caacggctct 


gtggacacag 


agaatgattc 


ctgcctgcag 


1080 


cagacacact 


ga 










1092 



<210> 32 

<211> 363 

<212> PRT 

<213> Homo sapiens 

<400> 32 

Met Gly Pro Gly Glu Ala Leu Leu Ala Gly Leu Leu Val Met Val Leu 
15 10 15 

Ala Val Ala Leu Leu Ser Asn Ala Leu Val Leu Leu Cys Cys Ala Tyr 
20 25 30 
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Ser Ala Glu Leu Arg Thr Arg Ala Ser Gly Val Leu Leu Val Asn Leu 
35 40 45 

Ser Leu Gly His Leu Leu Leu Ala Ala Leu Asp Met Pro Phe Thr Leu 
50 55 60 

Leu Gly Val Met Arg Gly Arg Thr Pro Ser Ala Pro Gly Ala Cys Gin 
65 70 75 80 

Val lie Gly Phe Leu Asp Thr Phe Leu Ala Ser Asn Ala Ala Leu Ser 
85 90 95 

Val Ala Ala Leu Ser Ala Asp Gin Trp Leu Ala Val Gly Phe Pro Leu 
100 105 . 110 

Arg Tyr Ala Gly Arg Leu Arg Pro Arg Tyr Ala Gly Leu Leu Leu Gly 
115 120 125 

Cys Ala Trp Gly Gin Ser Leu Ala Phe Ser Gly Ala Ala Leu Gly Cys 
130 135 140 

Ser Trp Leu Gly Tyr Ser Ser Ala Phe Ala Ser Cys Ser Leu Arg Leu 
145 150 155 160 

Pro Pro Glu Pro Glu Arg Pro Arg Phe Ala Ala Phe Thr Ala Thr Leu 
165 170 175 

His Ala Val Gly Phe Val Leu Pro Leu Ala Val Leu Cys Leu Thr Ser 
180 185 190 

Leu Gin Val His Arg Val Ala Arg Ser His Cys Gin Arg Met Asp Thr 
195 200 205 

Val Thr Met Lys Ala Leu Ala Leu Leu Ala Asp Leu His Pro Ser Val 
210 215 220 

Arg Gin Arg Cys Leu lie Gin Gin Lys Arg Arg Arg His Arg Ala Thr 
225 230 235 240 

Arg Lys He Gly Tie Ala He Ala Thr Phe Leu He Cys Phe Ala Pro 
245 250 255 

Tyr Val Met Thr Arg Leu Ala Glu Leu Val Pro Phe Val Thr Val Asn 
260 265 270 

Ala Gin Trp Gly He Leu Ser Lys Cys Leu Thr Tyr Ser Lys Ala Val 
275 280 285 

Ala Asp Pro Phe Thr Tyr Ser Leu Leu Arg Arg Pro Phe Arg Gin Val 
290 295 300 

Leu Ala Gly Met Val His Arg Leu Leu Lys Arg Thr Pro Arg Pro Ala 
305 310 315 320 

Ser Thr His Asp Ser Ser Leu Asp Val Ala Gly Met Val His Gin Leu 
325 330 335 

Leu Lys Arg Thr Pro Arg Pro Ala Ser Thr His Asn Gly Ser Val Asp 
340 345 350 

Thr Glu Asn Asp Ser Cys Leu Gin Gin Thr His 
355 360 

<210> 33 
<211> 1125 
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<212> DNA 

<213> Homo sapiens 

<400> 33 



atgcccacac 


tcaatacttc 


tgcctctcca 


cccacattct 


tctgggccaa 


tgcctccgga 


60 


ggcagtgtgc 


tgagtgctga 


tgatgctccg 


atgcctgtca 


aattcctagc 


cctgaggctc 


120 


atggttgccc 


tggcctatgg 


gcttgtgggg 


gccattggct 


tgctgggaaa 


tttggcggtg 


180 


ctgtgggtac 


tgagtaactg 


tgcccggaga 


gcccctggcc 


caccttcaga 


caccttcgtc 


240 


ttcaacctgg 


ctctggcgga 


cctgggactg 


gcactcactc 


tccccttttg 


ggcagccgag 


300 


tcggcactgg 


actttcactg 


gcccttcgga 


ggtgccctct 


gcaagatggt 


tctgacggcc 


360 


actgtcctca 


acgtctatgc 


cagcatcttc 


ctcatcacag 


cgctgagcgt 


tgctcgctac 


420 


tgggtggtgg 


ccatggctgc 


ggggccaggc 


acccacctct 


cactcttctg 


ggcccgaata 


480 


gccaccctgg 


cagt qt ggcc 


ggcggctgcc 


ctggtgacgg 


tgcccacagc 


tgtcttcggg 


540 


gtggagggtg 


c»g7r 7tgt ag 


t gtgcgcctt 


tgcctgctgc 


gtttccccag 


caggtactgg 


600 


ctgggggcct 


acc jqc t qcd 


gagggtggtg 


ctggctttca 


tggtgccctt 


gggcgtcatc 


660 


accaccagcL 


arjct 7Ct qt-t 


ncz qgccttc 


ctgcagcggc 


ggcaacggcg 


gcggcaggac 


720 


aqcagggtcq 


t ry.JCCCQCt C 


t g:. ccgcatc 


ctqqtqqctt 


ccttct:tcct: 


ctgctggttt 


780 


cccaaccatg 


togr c.d;:tct 


ct qqggtgtc 


ctggtgaagt 


ttgacctggt 


gccctggaac 


840 


agtactttct 




cacqtatgtc 


ttccctgtca 


ctacttgctt 


ggcacacagc 


900 


aatagctgcc 


tCtitiwCCtqt 


G cigtactgt 


ctcctgaggc 


gggagccccg 


gcaggctctg 


960 


gcaggcacct 


Tica^qaat c t 


gcggtcgagg 


ctgtggcccc 


agggcggagg 


ctgggtgcaa 


1020 


caggtggccc 


t aariqc<iqqt. 


aggcaggcgg 


tgggtcgcaa 


gcaacccccg 


ggagagccgc 


1080 


ccttctaccc 


tgct Cdccaa 


cctggacaga 


gggacacccg 


ggtga 




1125 



<210> 34 

<211> 374 

<212> PRT 

<213> Homo sapiens 

<400> 34 

Met Pro Thr Leu Asn Thr Ser Ala Ser Pro Pro Thr Phe Phe Trp Ala 
15 10 15 

Asn Ala Ser Gly Gly Ger Val Leu Ser Ala Asp Asp Ala Pro Met Pro 
20 25 30 

Val Lys Phe Leu Ala Leu Arg Leu Met Val Ala Leu Ala Tyr Gly Leu 
35 40 45 

Val Gly Ala He Gly Leu Leu Gly Asn Leu Ala Val Leu Trp Val Leu 
50 55 60 

Ser Asn Cys Ala Arg Arg Ala Pro Gly Pro Pro Ser Asp Thr Phe Val 
65 70 75 80 
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Phe Asn Leu Ala Leu Ala Asp Leu Gly Leu Ala Leu Thr Leu Pro Phe 
85 90 95 

Trp Ala Ala Glu Ser Ala Leu Asp Phe His Trp Pro Phe Gly Gly Ala 
100 105 110 

Leu Cys Lys Met Val Leu Thr Ala Thr Val Leu Asn Val Tyr Ala Ser 
115 120 125 

lie Phe Leu lie Thr Ala Leu Ser Val Ala Arg Tyr Trp Val Val Ala 
130 135 140 

Met Ala Ala Gly Pro Gly Thr His Leu Ser Leu Phe Trp Ala Arg lie 
145 150 155 160 

Ala Thr Leu Ala Val Trp Ala Ala Ala Ala Leu Val Thr Val Pro Thr 
165 170 175 

Ala Val Phe Gly Val Glu Gly Glu Val Cys Gly Val Arg Leu Cys Leu 
180 185 190 

Leu Arg Phe Pro Ser Arg Tyr Trp Leu Gly Ala Tyr Gin Leu Gin Arg 
19b 200 205 

Val Veil Leu Ala Phe Met Val Pro Leu Gly Val lie Thr Thr Ser Tyr 
210 215 220 

Leu Leu L^u Leu Ala Phe Leu Gin Arg Arg Gin Arg Arg Arg Gin Asp 
225 230 235 240 

Ser Arg Val VaJ Ala Arg Ser Val Arg lie Leu Val Ala Ser Phe Phe 
245 250 255 

Leu Cys Trp Ph*- Pro Asn His Val Val Thr Leu Trp Gly Val Leu Val 
260 265 270 

Lys Phe Asp Leu Val Pro Trp Asn Ser Thr Phe Tyr Thr lie Gin Thr 
275 280 285 

Tyr Val Pho Pro Val Thr Thr Cys Leu Ala His Ser Asn Ser Cys Leu 
290 295 300 

Asn Pro Va i Lou Tyr Cys Leu Leu Arg Arg Glu Pro Arg Gin Ala Leu 
305 310 315 320 

Ala Gly Thr The Arg Asp Leu Arg Ser Arg Leu Trp Pro Gin Gly Gly 
325 330 335 

Gly Trp Val Gin Gin Val Ala Leu Lys Gin Val Gly Arg Arg Trp Val 
340 345 350 

Ala Ser Asn Pro Arq Glu Ser Arg Pro Ser Thr Leu Leu Thr Asn Leu 
355 360 365 

Asp Arg Gly Thr Pro Gly 
370 

<210> 35 

<211> 1092 

<212> DNA 

<213> Homo sapiens 

<400> 35 

atgaatcggc accatctgca ggatcacttt ctggaaatag acaagaagaa ctgctgtg.tg 60 
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ttccgagatg 


acttcattgt 


caaggtgttg 


ccgccggtgt 


tggggctgga 


gtttatcttc 


120 


gggcttctgg 


gcaatggcct 


tgccctgtgg 


attttctgtt 


tccacctcaa 


gtcctggaaa 


180 


tccagccgga 


ttttcctgtt 


caacctggca 


gtggctgact 


ttctactgat 


catctgcctg 


240 


cccttcctga 


tggacaacta 


tgtgaggcgt 


tgggactgga 


agtttgggga 


catcccttgc 


300 


cggctgatgc 


tcttcatgtt 


ggctatgaac 


cgccagggca 


gcatcatctt 


cctcacggtg 


360 


gtggcggtag 


acaggtattt 


ccgggtggtc 


catccccacc 


acgccctgaa 


caagatctcc 


420 


aatcggacag 


cagccatcat 


ctcttgcctt 


ctgtggggca 


tcactattgg 


cctgacagtc 


480 


cacctcctga 


agaagaagat 


gccgatccag 


aatggcggtg 


caaatttgtg 


cagcagcttc 


540 


agcatctgcc 


ataccttcca 


gtggcacgaa 


gccatgttcc 


tcctggagtt 


cttcctgccc 


600 


ctgggcatca 


tcctgttctg 


ctcagccaga 


attatctgga 


gcctgcggca 


gagacaaatg 


660 


gaccggcatg 


ccaagatcaa 


gagagccatc 


accttcatca 


tggtggtggc 


catcgtcttt 


720 


gtcatctgct 


tccttcccag 


cgtggttgtg 


cggatccgca 


tcttctggct 


cctgcacact 


780 


tcgggcacgc 


agaattgtga 


agtgtaccgc 


tcggtggacc 


tggcgttctt 


tatcactctc 


840 


agcttcacct 


acatgaacag 


catgctggac 


cccgtggtgt 


actacttctc 


cagcccatcc 


900 


tttcccaact 


tctt ctccac 


tttgatcaac 


cgctgcctcc 


agaggaagat 


gacaggtgag 


960 


ccagataata 


accgcagcac 


gagcgtcgag 


ctcacagggg 


accccaacaa 


aaccagaggc 


1020 


gctccagagg 


cgttaatggc 


caactccggt 


gagccatgga 


gcccctctta 


tctgggccca 


1080 


acctctcctt 


aa 










1092 



<210> 36 

<211> 363 

<212> PRT 

<213> Homo sapiens 

<400> 36 

Met Asn Arg His His Leu Gin Asp His Phe Leu Glu He Asp Lys Lys 
i 5 10 15 

Asn Cys Cys Val Phe Arg Asp Asp Phe He Val Lys Val Leu Pro Pro 
20 25 30 

Val Leu Gly Leu Glu Phe He Phe Gly Leu Leu Gly Asn Gly Leu Ala 
35 40 45 

Leu Trp He Phe Cys Phe His Leu Lys Ser Trp Lys Ser Ser Arg He 
50 55 60 

Phe Leu Phe Asn Leu Ala Val Ala Asp Phe Leu Leu He He Cys Leu 
65 70 75 80 

Pro Phe Leu Met Asp Asn Tyr Val Arg Arg Trp Asp Trp Lys Phe Gly 
85 ■ 90 95 

Asp He Pro Cys Arg Leu Met Leu Phe Met Leu Ala Met Asn Arg Gin 
100 105 110 
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Gly Ser lie lie Phe Leu Thr Val Val Ala Val Asp Arg Tyr Phe Arg 
115 120 125 

Val Val His Pro His His Ala Leu Asn Lys lie Ser Asn Arg Thr Ala 
130 135 140 

Ala lie lie Ser Cys Leu Leu Trp Gly lie Thr lie Gly Leu Thr Val 
145 150 155 160 

His Leu Leu Lys Lys Lys Met Pro lie Gin Asn Gly Gly Ala Asn Leu 
165 170 175 

Cys Ser Ser Phe Ser lie Cys His Thr Phe Gin Trp His Glu Ala Met 
180 185 190 

Phe Leu Leu Glu Phe Phe Leu Pro Leu Gly lie He Leu Phe Cys Ser 
195 200 205 

Ala Arg He He Trp Ser Leu Arg Gin Arg Gin Met Asp Arg His Ala 
210 215 220 

Lys He Lys Arg Ala He Thr Phe He Met Val Val Ala He Val Phe 
225 230 235' 240 

Val He Cys Phe Leu Pro Ser Val Val Val Arg He Arg He Phe Trp 
245 250 255 

Leu Leu His Thr Ser Gly Thr Gin Asn Cys Glu Val Tyr Arg Ser Val 
260 265 270 

Asp Leu Ala Phe Phe He Thr Leu Ser Phe Thr Tyr Met Asn Ser Met 
275 280 285 

Leu Asp Pro Val Val Tyr Tyr Phe Ser Ser Pro Ser Phe Pro Asn Phe 
290 295 300 

Phe Ser Thr Leu He Asn Arg Cys Leu Gin Arg Lys Met Thr Gly Glu 
305 310 315 320 

Pro Asp Asn Asn Arg Ser Thr Ser Val Glu Leu Thr Gly Asp Pro Asn 
325 330 335 

Lys Thr Arg Gly Ala Pro Glu Ala Leu Met Ala Asn Ser Gly Glu Pro 
340 345 350 

Trp Ser Pro Ser Tyr Leu Gly Pro Thr Ser Pro 
355 360 

<210> 37 

<211> 1044 

<212> DNA 

<213> Homo sapiens 

<400> 37 

atgggggatg agctggcacc ttgccctgtg ggcactacag cttggccggc cctgatccag 60 

ctcatcagca agacaccctg catgccccaa gcagccagca acacttcctt gggcctgggg 120 

gacctcaggg tgcccagctc catgctgtac tggcttttcc ttccctcaag cctgctggct 180 

gcagccacac tggctgtcag ccccctgctg ctggtgacca tcctgcggaa ccaacggctg 240 
cgacaggagc cccactacct gctcccggct aacatcctgc tctcagacct ggcctacatt 



ctcctccaca tgctcatctc ctccagcagc ctgggtggct gggagctggg ccgcatggcc 
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tgtggcattc 


tcactgatgc 


tgtcttcgcc 


gcctgcacca 


gcaccatcct 


gtccttcacc 


420 


gccattgtgc 


tgcacaccta 


cctggcagtc 


atccatccac 


tgcgctacct 


ctccttcatg 


480 


tcccatgggg 


ctgcctggaa 


ggcagtggcc 


ctcatctggc 


tggtggcctg 


ctgcttcccc 


540 


acattcctta 


tttggctcag 


caagtggcag 


gatgcccagc 


tggaggagca 


aggagcttca 


600 


tacatcctac 


caccaagcat 


gggcacccag 


ccgggatgtg 


gcctcctggt 


cattgttacc 


660 


tacacctcca 


ttctgtgcgt 


tctgttcctc 


tgcacagctc 


tcattgccaa 


ctgtttctgg 


720 


aggatctatg 


cagaggccaa 


gacttcaggc 


atctgggggc 


agggctattc 


ccgggccagg 


780 


qgcaccctgc 


tgatccactc 


agtgctgatc 


acattgtacg 


tgagcacagg 


ggtggtgttc 


840 


t ccctggaca 


tggtgctgac 


caggtaccac 


cacattgact 


ctgggactca 


cacatggctc 


900 


ctggcagcta 


acagtgaggt 


actcatgatg 


cttccccgtg 


ccatgctccc 


atacctgtac 


960 


ct gctccgct 


accggcagct 


gttgggcatg 


gtccggggcc 


acctcccatc 


caggaggcac 


1020 


caqqccatct 


ttaccatttc 


ctag 








1044 



<210> 38 

<211> 347 

•;212> PRT 

<213> Homo sapiens 

<400> 38 

Met Gly Asp Giu Leu Ala Pro Cys Pro Val Gly Thr Thr Ala Trp Pro 
15 10 15 

Ala Leu lie Gin Leu lie Ser Lys Thr Pro Cys Met Pro Gin Ala Ala 
20 25 30 

Ser Asn Thr Ser Leu Gly Leu Gly Asp Leu Arg Val Pro Ser Ser Met 
35 40 45 

Leu Tyr Trp Leu Phe Leu Pro Ser Ser Leu Leu Ala Ala Ala Thr Leu 
50 55 60 

Ala Val Ser Pro Leu Leu Leu Val Thr lie Leu Arg Asn Gin Arg Leu 
65 70 75 80 

Arg Gin Glu Pro His Tyr Leu Leu Pro Ala Asn lie Leu Leu Ser Asp 
85 90 95 

Leu Ala Tyr He Leu Leu His Met Leu He Ser Ser Ser Ser Leu Gly 
100 105 110 

Gly Trp Glu Leu Gly Arg Met Ala Cys Gly He Leu Thr Asp Ala Val 
115 120 125 

Phe Ala Ala Cys Thr Ser Thr He Leu Ser Phe Thr Ala He Val Leu 
130 135 140 

His Thr Tyr Leu Ala Val He His Pro Leu Arg Tyr Leu Ser Phe Met 
145 150 155 160 

Ser His Gly Ala Ala Trp Lys Ala Val Ala Leu He Trp Leu Val Ala 
165 170 175 
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Cys Cys Phe Pro Thr Phe Leu He Trp Leu Ser Lys Trp Gin Asp Ala 

180 185 190 

Gin Leu Glu Glu Gin Gly Ala Ser Tyr He Leu Pro Pro Ser Met Gly 

195 200 205 

Thr Gin Pro Gly Cys Gly Leu Leu Val He Val Thr Tyr Thr Ser He 

210 215 220 

Leu Cys Val Leu Phe Leu Cys Thr Ala Leu He Ala Asn Cys Phe Trp 

225 230 235 240 

Arg He Tyr Ala Glu Ala Lys Thr Ser Gly He Trp Gly Gin Gly Tyr 

245 250 255 

Ser Arg Ala Arg Gly Thr Leu Leu He His Ser Val Leu He Thr Leu 

260 265 270 

Tyr Val Ser Thr Gly Val Val Phe Ser Leu Asp Met Val Leu Thr Arg 

275 280 285 

Tyr His His He Asp Ser Gly Thr His Thr Trp Leu Leu Ala Ala Asn 



Ser Arg Arg His Gin Ala He Phe Thr He Ser 
340 345 

<210> 39 

<211> 1023 

<212> DNA 

<213> Homo sapiens 

<400> 39 

atgaatccat ttcatgcatc ttgttggaac acctctgccg aacttttaaa caaatcctgg 60 

aataaagagt ttgcttatca aactgccagt gtggtagata cagtcatcct cccttccatg 120 

attgggatta tctgttcaac agggctggtt ggcaacatcc tcattgtatt cactataata 180 

agatccagga aaaaaacagt ccctgacatc tatatctgca acctggctgt ggctgatttg 240 

gtccacatag ttggaatgcc ttttcttatt- caccaatggg cccgaggggg agagtgggtg 300 

tttggggggc ctctctgcac catcatcaca tccctggata cttgtaacca atttgcctgt 360 

agtgccatca tgactgtaat gagtgtggac aggtactttg ccctcgtcca accatttcga 420 

ctgacacgtt ggagaacaag gtacaagacc atccggatca atttgggcct ttgggcagct 480 

tcctttatcc tggcattgcc tgtctgggtc tactcgaagg tcatcaaatt taaagacggt 540 

gttgagagtt gtgcttttga tttgacatcc cctgacgatg tactctggta tacactttat 600 

ttgacgataa caactttttt tttccctcta cccttgattt tggtgtgcta tattttaatt 660 

ttatgctata cttgggagat gtatcaacag aataaggatg ccagatgctg caatcccagt 720 

gtaccaaaac agagagtgat gaagttgaca aagatggtgc tggtgctggt ggtagtcttt 780 
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atcctgagtg ctgcccctta tcatgtgata caactggtga acttacagat ggaacagccc 840 

acactggcct tctatgtggg ttattacctc tccatctgtc tcagctatgc cagcagcagc 900 

attaaccctt ttctctacat cctgctgagt ggaaatttcc agaaacgtct gcctcaaatc 960 

caaagaagag cgactgagaa ggaaatcaac aatatgggaa acactctgaa atcacacttt 1020 

tag 1023 

<210> 40 

<211> 340 

<212> PRT 

<213> Homo sapiens 

<400> 40 

Met Asn Pro Phe His Ala Ser Cys Trp Asn Thr Ser Ala Glu Leu Leu 
15 10 IS 

Asn Lys Ser Trp Asn Lys Glu Phe Ala Tyr Gin Thr Ala Ser Val Val 
20 25 30 

Asp Thr Val lie Leu Pro Ser Met He Gly He He Cys Ser Thr Gly 
35 40 45 

Leu Val Gly Asn He Leu He Val Phe Thr He He Arg Ser Arg Lys 
50 55 60 

Lys Thr Val Pro Asp He Tyr He Cys Asn Leu Ala Val Ala Asp Leu 
65 70 75 80 

Val His He Val Gly Met Pro Phe Leu He His Gin Trp Ala Arg Gly 
85 90 95 

Gly Glu Trp Val Phe Gly Gly Pro Leu Cys Thr He He Thr Ser Leu 
100 105 110 

Asp Thr Cys Asn Gin Phe Ala Cys Ser Ala He Met Thr Val Met Ser 
115 120 125 

Val Asp Arg Tyr Phe Ala Leu Val Gin Pro Phe Arg Leu Thr Arg Trp 
130 135 140 

Arg Thr Arg Tyr Lys Thr He Arg He Asn Leu Gly Leu Trp Ala Ala 
145 150 155 160 

Ser Phe He Leu Ala Leu Pro Val Trp Val Tyr Ser Lys Val He Lys 
165 170 175 

Phe Lys Asp Gly Val Glu Ser Cys Ala Phe Asp Leu Thr Ser Pro Asp 
180 185 190 

Asp Val Leu Trp Tyr Thr Leu Tyr Leu Thr He Thr Thr Phe Phe Phe 
195 200 205 

Pro Leu Pro Leu He Leu Val Cys Tyr He Leu He Leu Cys Tyr Thr 
210 215 220 

Trp Glu Met Tyr Gin Gin Asn Lys Asp Ala Arg Cys Cys Asn Pro Ser 
225 230 235 240 

Val Pro Lys Gin Arg Val Met Lys Leu Thr Lys Met Val Leu Val Leu 
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245 250 255 

Val Val Val Phe lie Leu Ser Ala Ala Pro Tyr His Val lie Gin Leu 
260 265 270 

Val Asn Leu Gin Met Glu Gin Pro Thr Leu Ala Phe Tyr Val Gly Tyr 
275 280 285 

Tyr Leu Ser lie Cys Leu Ser Tyr Ala Ser Ser Ser lie Asn Pro Phe 
290 295 300 

Leu Tyr lie Leu Leu Ser Gly Asn Phe Gin Lys Arg Leu Pro Gin lie 
305 310 315 320 

Gin Arg Arg Ala Thr Glu Lys Glu lie Asn Asn Met Gly Asn Thr Leu 
325 330 335 

Lys Ser His Phe 
340 



<210> 


41 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


inisc_f eature 


<223> 


Novel Sequence 


<400> 


41 


cttgcagaca tcaccatggc agcc 


<210> 


42 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc__f eature 


<223> 


Novel Sequence 



24 



<400> 42 

gtgatgctct gagtactgga ctgg 24 

<210> 43 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 

<223> Novel Sequence 

<400> 43 

gaagctgtga agagtgatgc 20 

<210> 44 
<211> 24 
<212> DNA 
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<213> Artificial Sequence 



<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


44 


gtcagcaata ttgataagca gcag 


<210> 


45 


<211> 


27 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 



24 



<400> 45 

ccatggggaa cgattctgtc agctacg 27 



<210> 


46 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


46 


gctatgcctg aagccagtct tgtg 


<210> 


47 


<211> 


26 


<212> 


DNA 


<213> 


Artificial Sequence 



24 



<220> 

<221> misc_feature 
<223> Novel Sequence 

<400> 47 

ccaggatgtt gtgtcaccgt ggtggc 26 



<210> 


48 


<211> 


26 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


48 



cacagcgctg cagccctgca gctggc 26 
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<210> 49 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 

<223> Novel Sequence 

<400> 49 26 
cttcctctcg tagggatgaa ccagac 

<210> 50 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
<223> Novel Sequence 

<400> 50 26 
ctcgcacagg tgggaagcac ctgtgg- 



<210> 


51 


<211> 


23 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


51 


gcctgtgaca ggaggtaccc tgg 


<210> 


52 


<211> 


25 



23 



<212> DNA 

<213>- Artificial Sequence 
<220> 

<221> misc_f eature 

<223> Novel Sequence 

<400> 52 25 
catatccctc cgagtgtcca gcggc 

<210> 53 
<211> 31 
<212> DNA 

<213> Artificial Sequence 



<220> 

<221> mis cofeature 
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<223> Novel Sequence 



<400> 


53 


gcatggagag aaaatttatg tccttgcaac 


<210> 


54 


<211> 


27 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


54 


caagaacagg tctcatctaa gagctcc 


<210> 


55 


<211> 


26 


<212> 


DNA 


<213> 


Artificial Sequence 



31 



27 



<220> 

<221> misc_f eature 
<223> Novel Sequence 

<400> 55 

gctgttgcca tgacgtccac ctgcac 26 



<210> 


56 


<211> 


26 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc feature 


<223> 


Novel Sequence 



<400> 56 

ggacagttca aggtttgcct tagaac 26 



<210> 


57 


<211> 


23 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc feature 


<223> 


Novel Sequence 


<400> 


57 


ctttcgatac tgctcctatg etc 


<210> 


58 


<211> 


26 



23 
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<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


58 


gtagtccact gaaagtccag tgatcc 


<210> 


59 


<211> 


26 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc t ear urc 


<223> 


Novo 1 iler^ut-fnce 



26 



<400> 59 

tttctgagca tyqitccaac cdtctc 26 



<210> 


60 


<211> 


25 


<212> 


DNA 


<213> 


Artific:.ii C'^n 


<220> 




<221> 


misc_f '.'at ur *■ 


<223> 


Novel Seqaence 



<400> 60 

ctgtctgaca gggcj^a^qc tcttc 25 

<210> 61 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eatur o 

<223> Novel Seque net- 

<400> 61 

ggaactcgta tagacccagc gtcgctcc 28 



<210> 


62 


<211> 


28 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc feature 


<223> 


Novel Sequence 


<400> 


62 
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ggaggttgcg ccttagcgac agatgacc 28 



<210> 


63 


<211> 


22 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


inisc_f eature 


<223> 


Novel Sequence 


<400> 


63 


ctgcacccgg acacttgctc tg 


<210> 


64 


<211> 


25 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


Tnisc_f eature 


<223> 


Novel Sequence 



22 



<400> 64 

gtctgcttgt tcagtgccac tcaac 25 

<210> 65 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 

<223> Novel Sequence 

<400> 65 

tatctgcaat tctattctag ctcctg 26 



<210> 


66 


<211> 


26 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


inisc_feature 


<223> 


Novel Sequence 


<400> 


66 


tgtccctaat aaagtcacat gaatgc 


<210> 


67 


<211> 


23 


<212> 


DNA 


<213> 


Artificial Sequence 



26 



<220> 
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<221> misc_f eature 
<223> Novel Sequence 



<400> 67 

ggagacaacc atgaatgagc cac 



23 



<210> 68 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> mi sc__f eature 

<223> Novel Sequence 



<400> 68 

tatttcaagg gttgtttgag taac 24 



<210> 69 

<211> 27 

<212> DNA 

<213> Artiticial Sequence 
<220> 

<221> mibc feature 

<223> Novel Sequence 



<210> 


70 


<211> 


27 


<212> 


DNA 


<213> 


Artilicial Sequence 


<220> 




<221> 


misc_ teature 


<223> 


Novel Sequence 


<400> 


70 



ctgatggaag tagaqgctgt ccatctc 27 



<210> 71 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 

<223> Novel Sequence 



<400> 

ggcaccag t g 



gaggtcttct gagcatg 



27 



<400> 71 

cctggcgagc cgctagcgcc atg 



23 



<210> 



72 
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<211> 


23 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


72 


atgagccctg ccaggccctc agt 


<210> 


73 


<211> 


27 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 



23 



<400> 73 

ctgcgatgcc cacactcaat acttctg 27 



<210> 


74 


<211> 


27 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


74 


aaggatccta cacttggtgg atctcag 


<210> 


75 


<211> 


22 


<212> 


DNA 


<213> 


Artificial Sequence 



27 



<400> 75 

gctggagcat tcactaggcg ag 22 

<210> 76 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> mi sc__f eature 

<223> Novel Sequence 

<400> 76 

agatcctggt tcttggtgac aatg 24 
<210> 77 

Page 4 7 



wo 01/36471 



PCTAJSOO/31509 



<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


77 


agccatccct gccaggaagc atgg 


<210> 


78 


<211> 


27 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 



24 



<400> 78 

ccagactgtg gactcaagaa ctctagg 27 



<210> 


79 


<211> 


28 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


79 



agtccacgaa caatgaatcc atttcatg 



28 



<210> 


80 


<211> 


25 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


80 


atcatgtcta gactcatggt gatcc 


<210> 


81 


<211> 


30 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 



25 
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<400> 81 

ggggagggaa agcaaaggtg gtcctcctgg 30 

<210> 82 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 

<223> Novel Sequence 

<400> 82 

ccaggagaac cacctttgct ttccctcccc 30 

<210> 83 

<211> 1356 

<212> DNA 

<213> Homo sapiens 

<400> 83 

atggagtcct cacccatccc ccagtcatca gggaactctt ccactttggg gagggtccct 60 

caaaccccag qtccctctac tgccagtggg gtcccggagg tggggctacg ggatgttgct 120 

tcggaatctg tggccctctt cttcatgctc ctgctggact tgactgctgt ggctggcaat 180 

gccgctgtga tqqccgtgat cgccaagacg cctgccctcc gaaaatttgt cttcgtcttc 240 

cacctcLgcc t.r:qt:ggacct gctggctgcc ctgaccctca ' tgcccctggc catgctctcc 300 

agctctgccc Lctttgacca cgccctcttt ggggaggtgg cctgccgcct ctacttgttt 360 

ctgagcgtgt qctttgtcag cctggccatc ctctcggtgt cagccatcaa tgtggagcgc 420 

tactattacg ragtccaccc catgcgctac gaggtgcgca tgacgctggg gctggtggcc 480 

tctgtgctgg tgggtgtgtg ggtgaaggcc ttggccatgg cttctgtgcc agtgttggga 540 

agggtctcct ggcaggaagg agctcccagt gtccccccag gctgttcact ccagtggagc 600 

cacagtgcct actgccagct ttttgtggtg gtctttgctg tcctttactt tctgttgccc 660 

ctgctcctca tacttgtggt ctactgcagc atgttccgag tggcccgcgt ggctgccatg 720 

cagcacgggc cgctgcccac gtggatggag acaccccggc aacgctccga atctctcagc 780 

agccgctcca cgatggtcac cagctcgggg gccccccaga ccaccccaca ccggacgttt 840 

gggggaggga aagcaaaggt ggttctcctg gctgtggggg gacagttcct gctctgttgg 900 

ttgccctact tccctttcca cctctatgtt gccctgagtg ctcagcccat ttcaactggg 960 

caggtggaga gtgtggtcac ctggattggc tacttttgct tcacttccaa ccctttcttc 1020 

tatggatgtc tcaaccggca gatccggggg gagctcagca agcagtttgt ctgcttcttc 1080 

aagccagctc cagaggagga gctgaggctg cctagccggg agggctccat tgaggagaac 1140 

ttcctgcagt tccrtcaggg gactggctgt ccttctgagt cctgggtttc ccgaccccta 1200 

cccagcccca agcaggagcc acctgctgtt gactttcgaa tcccaggcca gatagctgag 1260 
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gagacctctg agttcctgga gcagcaactc accagcgaca tcatcatgtc agacagctac 1320 
ctccgtcctg ccgcctcacc ccggctggag tcatga 1356 

<210> 84 

<211> 451 

<212> PRT 

<213> Homo sapiens 

<400> 84 

Met Glu Ser Ser Pro He Pro Gin Ser Ser Gly Asn Ser Ser Thr Leu 
15 10 15 

Gly Arg Val Pro Gin Thr Pro Gly Pro Ser Thr Ala Ser Gly Val Pro 
20 25 30 

Glu Val Gly Leu Arg Asp Val Ala Ser Glu Ser Val Ala Leu Phe Phe 
35 40 45 

Met Leu Leu Leu Asp Leu Thr Ala Val Ala Gly Asn Ala Ala Val Met 
50 55 60 

Ala Val lie Ala Lys Thr Pro Ala Leu Arg Lys Phe Val Phe Val Phe 
65 70 75 80 

His Leu Cys Leu Val Asp Leu Leu Ala Ala Leu Thr Leu Met Pro Leu 
85 90 95 

Ala Met Leu Ser Ser Ser Ala Leu Phe Asp His Ala Leu Phe Gly Glu 
100 105 110 

Val Ala Cys Arg Leu Tyr Leu Phe Leu Ser Val Cys Phe Val Ser Leu 
115 120 125 

Ala He Leu Ser Val Ser Ala He Asn Val Glu Arg Tyr Tyr Tyr Val 
130 135 140 

Val His Pro Met Arg Tyr Glu Val Arg Met Thr Leu Gly Leu Val Ala 
145 150 155 160 

Ser Val Leu Val Gly Val Trp Val Lys Ala Leu Ala Met Ala Ser Val 
165 170 175 

Pro Val Leu Gly Arg Val Ser Trp Glu Glu Gly Ala Pro Ser Val Pro 
180 185 190 

Pro Gly Cys Ser Leu Gin Trp Ser His Ser Ala Tyr Cys Gin Leu Phe 
195 200 205 

Val Val Val Phe Ala Val Leu Tyr Phe Leu Leu Pro Leu Leu Leu He 
210 215 220 

Leu Val Val Tyr Cys Ser Met Phe Arg Val Ala Arg Val Ala Ala Met 
225 230 235 240 

Gin His Gly Pro Leu Pro Thr Trp Met Glu Thr Pro Arg Gin Arg Ser 
245 250 255 

Glu Ser Leu Ser Ser Arg Ser Thr Met Val Thr Ser Ser Gly Ala Pro 
260 265 270 

Gin Thr Thr Pro His Arg Thr Phe Gly Gly Gly Lys Ala Lys Val Val 
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275 280 285 

Leu Leu Ala Val Gly Gly Gin Phe Leu Leu Cys Trp Leu Pro Tyr Phe 
290 295 300 

Ser Phe His Leu Tyr Val Ala Leu Ser Ala Gin Pro lie Ser Thr Gly 
305 310 315 320 

Gin Val Glu Ser Val Val Thr Trp He Gly Tyr Phe Cys Phe Thr Ser 
325 ^ 330 335 

Asn Pro Phe Phe Tyr Gly Cys Leu Asn Arg Gin He Arg Gly Glu Leu 
340 345 350 

Ser Lys Gin Phe Val Cys Phe Phe Lys Pro Ala Pro Glu Glu Glu Leu 
355 360 365 

Arg Leu Pro Ser Arg Glu Gly Ser He Glu Glu Asn Phe Leu Gin Phe 
370 375 380 

Leu Gin Gly Thr Gly Cys Pro Ser Glu Ser Trp Val Ser Arg Pro Leu 
385 390 395 400 

Pro Ser Pro Lys Gin Glu Pro Pro Ala Val Asp Phe Arg He Pro Gly 
405 410 415 

Gin He Ala Glu Glu Thr Ser Glu Phe Leu Glu Gin Gin Leu Thr Ser 
420 425 430 

Asp He He Met Ser . Asp Ser Tyr Leu Arg Pro Ala Ala Ser Pro Arg 
435 440 445 

Leu Glu Ser 
4 50 

<210> 85 

<211> 28 

<212> DNA 

<213> Homo sapiens 

<400> 85 

caggaaggca aagaccacca tcatcatc 28 

<210> 86 

<211> 28 

<212> DNA 

<213> Homo sapiens 

<400> 86 

gatgatgatg gtggtctttg ccttcctg 28 

<210> 87 

<211> 1041 

<212> DNA 

<213> Homo sapiens 

<400> 87 

atggagagaa aatttatgtc cttgcaacca tccatctccg tatcagaaat ggaaccaaat 60 

ggcaccttca gcaataacaa cagcaggaac tgcacaattg aaaacttcaa gagagaattt 120 

ttcccaattg tatatctgat aatatttttc tggggagtct tgggaaatgg gttgtccata 180 
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tatgttttcc 


tgcagcctta 


taagaagtcc 


acatctgtga 


acgttttcat 


gctaaatctg 


240 


gccatttcag 


atctcctgtt 


cataagcacg 


cttcccttca 


gggctgacta 


ttatcttaga 


300 


ggctccaatt 


ggatatttgg 


agacctggcc 


tgcaggatta 


tgtcttattc 


cttgtatgtc 


360 


aacatgtaca 


gcagtattta 


tttcctgacc 


gtgctgagtg 


ttgtgcgttt 


cctggcaatg 


420 


gttcacccct 


ttcggcttct 


gcatgtcacc 


agcatcagga 


gtgcctggat 


cctctgtggg 


480 


atcatatgga 


tccttatcat 


ggcttcctca 


ataatgctcc 


tggacagtgg 


ctctgagcag 


540 


aacggcagtg 


tcacatcatg 


cttagagctg 


aatctctata 


aaattgctaa 


gctgcagacc 


600 


atgaactata 


ttgccttggt 


ggtgggctgc 


ctgctgccat 


ttttcacact 


cagcatctgt 


660 


tatctgctga 


tcattcgggt 


tctgttaaaa 


gtggaggtcc 


cagaatcggg 


gctgcgggtt 


720 


tctcacagga 


aggcaaagac 


caccatcatc 


atcaccttga 


tcatcttctt 


cttgtgtttc 


780 


ctgccctatc 


acacactgag 


gaccgtccac 


ttgacgacat 


aqaaagtggq 


tttatgcaaa 


840 


gacagactgc 


ataaagcttt 


ggttatcaca 


ctggccttgg 


cagcagccaa 


tgcctgcttc 


900 


aatcctctgc 


tctattactt 


tgctggggag 


aattttaagg 


acagactaaa 


gtctgcactc 


960 


agaaaaggcc 


atccacagaa 


ggcaaagaca 


aagtgtgttt 


tccctgttag 


tgtgtggttg 


1020 


agaaaggaaa 


caagagtata 


a 








1041 



<210> 88 

<211> 346 

<212> PRT 

<213> Homo sapiens 

<400> 88 

Met Glu Arg Lys Phe Met Ser Leu Gin Pro Ser lie Ser Val Ser Glu 
15 10 15 

Met Glu Pro Asn Gly Thr Phe Ser Asn Asn Asn Ser Arg Asn Cys Thr 
20 25 30 

lie Glu Asn Phe Lys Arg Glu Phe Phe Pro lie Val Tyr Leu lie lie 
35 40 45 

Phe Phe Trp Gly Val Leu Gly Asn Gly Leu Ser lie Tyr Val Phe Leu 
50 55 60 

Gin Pro Tyr Lys Lys Ser Thr Ser Val Asn Val Phe Met Leu Asn Leu 
65 70 75 80 

Ala lie Ser Asp Leu Leu Phe lie Ser Thr Leu Pro Phe Arg Ala Asp 
85 90 95 

Tyr Tyr Leu Arg Gly Ser Asn Trp lie Phe Gly Asp Leu Ala Cys Arg 
100 105 110 

lie Met Ser Tyr Ser Leu Tyr Val Asn Met Tyr Ser Ser lie Tyr Phe 
115 120 125 

Leu Thr Val Leu Ser Val Val Arg Phe Leu Ala Met Val His Pro Phe 
130 135 140 
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Arg Leu Leu His Val Thr Ser lie Arg Ser Ala Trp lie Leu Cys Gly 
145 150 155 160 

lie He Trp He Leu He Met Ala Ser Ser He Met Leu Leu Asp Ser 
165 170 175 

Gly Ser Glu Gin Asn Gly Ser Val Thr Ser Cys Leu Glu Leu Asn Leu 
180 185 190 

Tyr Lys He Ala Lys Leu Gin Thr Met Asn Tyr He Ala Leu Val Val 
195 200 205 

Gly Cys Leu Leu Pro Phe Phe Thr Leu Ser He Cys Tyr Leu Leu He 
210 215 220 

He Arg Val Leu Leu Lys Val Glu Val Pro Glu Ser Gly Leu Arg Val 
225 230 235 240 

Ser His Arg Lys Ala Lys Thr Thr He He He Thr Leu He He Phe 
245 250 255 

Phe Leu Cys Phe Leu Pro Tyr His Thr Leu Arg Thr Val His Leu Thr 
260 265 270 

Thr Trp Lys Val Gly Leu Cys Lys Asp Arg Leu His Lys Ala Leu Val 
275 280 285 

He Thr Leu Ala Leu Ala Ala Ala Asn Ala Cys Phe Asn Pro Leu Leu 
290 295 300 

Tyr Tyr Phe Ala Gly Glu Asn Phe Lys Asp Arg Leu Lys Ser Ala Leu 
305 310 315 320 

Arg Lys Gly His Pro Gin Lys Ala Lys Thr Lys Cys Val Phe Pro Val 
325 330 335 

Ser Val Trp Leu Arg Lys Glu Thr Arg Val 
340 345 

<210> 89 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 
<223> Novel Sequence 

<400> 89 

ccagtgcaaa gctaagaaag tgatcttc 28 



<210> 


90 


<211> 


28 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc feature 


<223> 


Novel Sequence 


<400> 


90 



gaagatcact ttcttagctt tgcactgg 28 

Page 53 



BNSDOCID: <WO 0136471 A2_L> 



wo 01/36471 



PCT/USOO/31509 



<210> 91 

<211> 1527 

<212> DNA 

<213> Homo sapiens 












<400> 91 
a tgacgtcca 


cctgcaccaa 


cagcacgcgc 


gagagtaaca 


gcagccacac 


gtgcatgccc 


60 


ct ctccaaaa 


tgcccatcag 


cctggcccac 


ggcatcatcc 


gctcaaccgt 


gctggttatc 


120 


ttcctcgccg 


cctctttcgt 


cggcaacata 


gtgctggcgc 


tagtgttgca 


gcgcaagccg 


180 


cagctgctgc 


aggtgaccaa 


ccgttttatc 


tttaacctcc 


tcgtcaccga 


cctgctgcag 


240 


atttcgctcg 


tggccccctg 


ggtggtggcc 


acctctgtgc 


ctctcttctg 


gcccctcaac 


300 


agccacttct 


gcacggccct 


ggttagcctc 


acccacctgt 


tcgccttcgc 


cagcgtcaac 


360 


accattgtcg 


tggtgtcagt 


ggatcgctac 


ttgtccatca 


tccaccctct 


ctcctacccg 


420 


tccaagatga 


cccagcgccg 


cggttacctg 


ctcctctatg 


gcacctggat 


tgtggccatc 


480 


ctgcagagca 


ctcctccact 


CtaCQQCtQQ 


qaccaqqctq 


cct ttgatga 


gcgcaatgct 


540 


ctctgctcca 


tgatctgggg 


ggccagcccc 


agctacacta 


ttctcagcgt 


QqtlCTtCCttC 


600 


atcgtcattc 


cactgattgt 


catgattgcc 


tgctactccg 


tggt.gtt.ct.g 


tgcagcccgg 


660 


aggcagcatg 


ctctgctgta 


caatgtcaag 


agacacagct 


^ y D w ^ V ^ ^3 


1. W u u ^ CI 


720 


tgtgtggaga 


atgaggatga 


aaaaqqaaca 


qaqaaqaaqq 


aggagtt cca 


ggatgagagt 


780 


gagtttcgcc 


gccagcatga 


aggtgaqqtc 


aaqqccaaqq 


agggcagaat 


ggaagccaag 


84 0 


gacggcagcc 


tgaaggccaa 


ggaaggaagc 


acqqqqacca 


qtqaqaotaq 


tqtaaaqacc 


900 


sggggcagcg 


aggaggtcag 


agagagcagc 


acggtgqcca 


qcqacqgcaq 

3'»-'3** — 33"*"— 


catqqaqqqt 

^33"333 


960 


aaggaaggca 


gcaccaaagt 


tgaggagaac 


agcatgaagg 


cagacaaggg 

3 *^333 


tcgcacagag 


1020 


gtcaaccagt 


gcagcattga 


cttgggtgaa 


gatgacatqg 


agtttggtga 

3 33 3 


agacgacatc 


1080 


aatttcagtg 


aggatgacgt 


cgaggcagtg 


aacatcccgg 


agagcctccc 


acccagtcgt 


1140 


cgtaacagca 


acagcaaccc 


tcctctgccc 


aggtgctacc 


agtgcaaagc 


taagaaagtg 


1200 


atcttcatca 


tcattttctc 


ctatgtgcta 


tccctggggc 


cctactgctt 


tttagcagtc 


1260 


ctggccgtgt 


gggtggatgt 


cgaaacccag 


gtaccccagt 


gggtgatcac 


cataatcatc 


1320 


tggcttttct 


tcctgcagtg 


ctgcatccac 


ccctatgtct 


atggctacat 


gcacaagacc 


1380 


attaagaagg 


aaatccagga 


catgctgaag 


aagttcttct 


gcaaggaaaa 


gcccccgaaa 


1440 


gaagatagcc 


acccagacct 


gcccggaaca 


gagggtggga 


ctgaaggcaa 


gattgtccct 


1500 


tcctacgatt 


ctgctacttt 


tccttga 








1527 



<210> 92 

<211> 508 

<212> PRT 

<213> Homo sapiens 
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<400> 92 

Met Thr Ser Thr 
1 

Thr Cys Met Pro 
20 

lie Arg Ser Thr 
35 

Asn lie Val Leu 
50 

Val Thr Asn Arg 
65 

lie Ser Leu Val 



Trp Pro Leu Asn 
100 

Leu Phe Ala Phe 
115 

Arg Tyr Leu Ser 
130 



Gin Arg Arg Gly 
145 

Leu Gin Ser Thr 



Glu Arg Asn Ala 
180 

Thr lie Leu Ser 
195 

lie Ala Cys Tyr 
210 

Leu Leu Tyr Asn 
225 

Cys Val Glu Asn 



Gin Asp Glu Ser 
260 

Lys Glu Gly Arg 
275 

Gly Ser Thr Gly 
290 

Glu Val Arg Glu 
305 

Lys Glu Gly Ser 



Cys Thr 
5 

Leu Ser 

Val Leu 

Ala Leu 

Phe He 
70 

Ala Pro 
85 

Ser His 
Ala Ser 



He He 

Tyr Leu 
150 

Pro Pro 
165 

Leu Cys 

Val Val 

Ser Val 

Val Lys 
230 

Glu Asp 
245 

Glu Phe 

Met Glu 

Thr Ser 

Ser Ser 
310 

Thr Lys 

325 



Asn Ser 

Lys Met 

Val He 
40 

Val Leu 
55 

Phe Asn 

Trp Val 

Phe Cys 

Val Asn 
120 

His Pro 
135 

Leu Leu 

Leu Tyr 

Ser Met 

Ser Phe 
200 

Val Phe 
215 

Arg His 

Glu Glu 

Arg Arg 

Ala Lys 
280 

Glu Ser 
295 

Thr Val 
Val Glu 



Thr Arg 
10 



Pro He 
25 

Phe Leu 

Gin Arg 

Leu Leu 

Val Ala 
90 

Thr Ala 
105 

Thr He 

Leu Ser 

Tyr Gly 

Gly Trp 
170 

He Trp 
185 

He Val 



Cys Ala 



Ser Leu 



Gly Ala 
250 



Gin His 
265 



Asp Gly 



Ser Val 



Ala Ser 



Glu Asn 

330 



Glu Ser 

Ser Leu 

Ala Ala 

Lys Pro 
60 

Val Thr 
75 

Thr Ser 

Leu Val 

Val Val 

Tyr Pro 
140 

Thr Trp 
155 

Gly Gin 

Gly Ala 

He Pro 

Ala Arg 
220 

Glu Val 
235 

Glu Lys 

Glu Gly 

Ser Leu 

Glu Ala 
300 

Asp Gly 
315 

Ser Met 
Page 



Asn Ser 



Ala His 
30 

Ser Phe 
45 

Gin Leu 

Asp Leu 

Val Pro 

Ser Leu 
110 

Val Ser 
125 

Ser Lys 

He Val 

Ala Ala 

Ser Pro 
190 

Leu He 
205 

Arg Gin 

Arg Val 

Lys Glu 

Glu Val 
270 

Lys Ala 
285 

Arg Gly 

Ser Met 

Lys Ala 
55 



Ser His 
15 

Gly He 

Val Gly 

Leu Gin 

Leu Gin 
80 

Leu Phe 
95 

Thr His 

Val Asp 

Met Thr 

Ala He 
160 

Phe Asp 
175 

Ser Tyr 

Val Met 

His Ala 

Lys Asp 
240 

Glu Phe 
255 

Lys Ala 

Lys Glu 

Ser Glu 

Glu Gly 
320 

Asp Lys 
335 
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Gly Arg Thr Glu Val Asn Gin Cys Ser He Asp Leu Gly Glu Asp Asp 
340 345 350 



Met Glu Phe Gly Glu Asp Asp He Asn Phe Ser Glu Asp Asp Val Glu 
355 360 365 



Ala Val Asn He Pro Glu Ser Leu Pro Pro Ser Arg Arg Asn Ser Asn 
370 375 380 



Ser Asn Pro Pro Leu Pro Arg Cys Tyr Gin Cys Lys Ala Lys Lys Val 
385 390 395 400 



He Phe He He He Phe Ser Tyr Val Leu Ser Leu Gly Pro Tyr Cys 
405 410 415 



Phe Leu Ala Val Leu Ala Val Trp Val Asp Val Glu Thr Gin Val Pro 
420 425 430 



Gin Trp Val 11*^ Thr He He He Trp Leu Phe Phe Leu Gin Cys Cys 
435 440 445 



He His Pro Tyr 
450 



Vrti Tyr Gly Tyr Met His Lys Thr He Lys Lys Glu 
455 . 460 



He Gin Asp M+ft L^.; Ly,-^ Lys Phe Phe Cys Lys Glu Lys Pro Pro Lys 
465 4 ?C 475 480 



Glu Asp Ser Hii r'r: A:sp La.*u Pro Gly Thr Glu Gly Gly Thr Glu Gly 
-IH' 490 495 



Lys He Val Pre r Tyi Asp Ser Ala Thr Phe Pro 

5C:= 505 



<210> 93 

<211> 29 

<212> DNA 

<213> Artificial :;*-qut?nce 
<220> 

<221> misc_f eat ur r 

<223> Novel Sequence 

<400> 93 

gccgccaccg cgccaagaqq aagattggc 29 

<210> 94 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 

<223> Novel Sequencte 



<400> 94 

gccaatcttc ctcttggcgc ggtggcggc 



29 



<210> 
<211> 
<212> 



95 

1092 
DNA 
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<213> Homo sapiens 



<400> 95 



atgggccccg 


gcgaggcgct 


gctggcgggt 


ctcctggtga 


tggtactggc 


cgtggcgctg 


60 


ctatccaacg 


cactggtgct 


gctttgttgc 


gcctacagcg 


ctgagctccg 


cactcgagcc 


120 


tcaggcgtcc 


tcctggtgaa 


tctgtcgctg 


ggccacctgc 


tgctggcggc 


gctggacatg 


180 


cccttcacgc 


tgctcggtgt 


gatgcgcggg 


cggacaccgt 


cggcgcccgg 


cgcatgccaa 


240 


gtcattggct 


tcctggacac 


cttcctggcg 


tccaacgcgg 


cgctgagcgt 


ggcggcgctg 


300 


agcgcagacc 


agtggctggc 


agtgggcttc 


ccactgcgct 


acgccggacg 


cctgcgaccg 


360 


cgctatgccg 


gcctgctgct 


gggctgtgcc 


tggggacagt 


cgctggcctt 


ctcaggcgct 


420 


gcacttggct 


gctcgtggct 


tggctacagc 


agcgccttcg 


cgtcctgttc 


gctgcgcctg 


480 


ccgcccgagc 


ctgagcgtcc 


gcgcttcgca 


gccttcaccg 


ccacgctcca 


tgccgtgggc 


540 


ttcgtgctgc 


cgctggcggt 


gctctgcctc 


acctcgctcc 


aggtgcaccg 


ggtggcacgc 


600 


agccactgcc 


agcgcatgga 


caccgtcacc 


atgaaggcgc 


tcgcgctgct 


cgccgacctg 


660 


caccccagtg 


tgcggcagcg 


ctgcctcatc 


cagcagaagc 


ggcgccgcca 


ccgcgccacc 


720 


aggaagattg 


gcattgctat 


tgcgaccttc 


ctcatctgct 


ttgccccgta 


tgtcatgacc 


780 


aggctggcgg 


agctcgtgcc 


cttcgtcacc 


gtgaacgccc 


agaagggcat 


cctcagcaag 


840 


tgcctgacct 


acagcaaggc 


ggtggccgac 


ccgttcacgt 


actctctgct 


ccgccggccg 


900 


ttccgccaag 


tcctggccgg 


catggtgcac 


cggctgctga 


agagaacccc 


gcgcccagca 


960 


tccacccatg 


acagctctct 


ggatgtggcc 


ggcatggtgc 


accagctgct 


gaagagaacc 


1020 


ccgcgcccag 


cgtccaccca 


caacggctct 


gtggacacag 


agaatgattc 


ctgcctgcag 


1080 


cagacacact 


ga 










1092 



<210> 96 

<211> 363 

<212> PRT 

<213> Homo sapiens 

<400> 96 

Met Gly Pro Gly Glu Ala Leu Leu Ala Gly Leu Leu Val Met Val Leu 
15 10 15 

Ala Val Ala Leu Leu Ser Asn Ala Leu Val Leu Leu Cys Cys Ala Tyr 
20 25 30 

Ser Ala Glu Leu Arg Thr Arg Ala Ser Gly Val Leu Leu Val Asn Leu 
35 40 45 

Ser Leu Gly His Leu Leu Leu Ala Ala Leu Asp Met Pro Phe Thr Leu 
50 55 60 

Leu Gly Val Met Arg Gly Arg Thr Pro Ser Ala Pro Gly Ala Cys Gin 
65 70 75 80 

Val lie Gly Phe Leu Asp Thr Phe Leu Ala Ser Asn Ala Ala Leu Ser 
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85 90 95 

Val Ala Ala Leu Ser Ala Asp Gin Trp Leu Ala Val Gly Phe Pro Leu 
100 105 110 

Arg Tyr Ala Gly Arg Leu Arg Pro Arg Tyr Ala Gly Leu Leu Leu Gly 
115 120 125 

Cys Ala Trp Gly Gin Ser Leu Ala Phe Ser Gly Ala Ala Leu Gly Cys 
130 135 140 

Ser Trp Leu Gly Tyr Ser Ser Ala Phe Ala Ser Cys Ser Leu Arg Leu 
145 150 155 160 

Pro Pro Glu Pro Glu Arg Pro Arg Phe Ala Ala Phe Thr Ala Thr Leu 
165 170 175 

His Ala Val Gly Phe Val Leu Pro Leu Ala Val Leu Cys Leu Thr Ser 
180 185 190 

Leu Gin Val His Arg Val Ala Arg Ser His Cys Gin Arg Met Asp Thr 
195 200 205 

Val Thr Met: Lys Ala Leu Ala Leu Leu Ala Asp Leu His Pro Ser Val 
210 215 220 

Arg Gin Arg Cys Leu lie Gin Gin Lys Arg Arg Arg His Arg Ala Thr 
225 230 235 240 

Arg Lys lie Gly lie Ala lie Ala Thr Phe Leu lie Cys Phe Ala Pro 
245 250 255 

Tyr Val Met Thr Arg Leu Ala Glu Leu Val Pro Phe Val Thr Val Asn 
260 265 270 

Ala Gin Lys Gly lie Leu Ser Lys Cys Leu Thr Tyr Ser Lys Ala Val 
275 280 285 

Ala Asp Pro Phe Thr Tyr Ser Leu Leu Arg Arg Pro Phe Arg Gin Val 
290 295 300 

Leu Ala Gly Met Val His Arg Leu Leu Lys Arg Thr Pro Arg Pro Ala 
305 310 315 320 

Ser Thr His Asp Ser Ser Leu Asp Val Ala Gly Met Val His Gin Leu 
325 330 335 

Leu Lys Arg Thr Pro Arg Pro Ala Ser Thr His Asn Gly Ser Val Asp 
340 345 350 

Thr Glu Asn Asp Ser Cys Leu Gin Gin Thr His 
355 360 



<210> 


97 


<211> 


34 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


97 



gatctctaga atggagtcct cacccatccc ccag 34 
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<210> 98 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
<223> Novel Sequence 

<400> 98 

gatcgatatc cgtgactcca gccggggtga ggcggc . 36 

<210> 99 
<211> 2610 
<212> DNA 

<213> Homo sapiens and Rat 
<400> 99 

atggagtcct cacccatccc ccagtcatca gggaactctt ccactttggg gagggtccct 60 

caaaccccag a-ccctct.ic tgccagtggg gtcccggagg tggggctacg ggatgttgct 120 

tcggaatctc t r;rrctcit ctiratgctc ctgctggact tgactgctgt ggctggcaat 180 

gccgctgtga t - ^^ t ^3-* t c.:;cc^agacg cctgccctcc gaaaatttgt cttcgtcttc 240 

cacctctgcc i 7 .;t : ; jcct rj -tqcjctgcc ctgaccctca tgcccctggc catgctctcc 300 

agctctgccc ret-, t.7jcc^ cqccctcttt ggggaggtgg cctgccgcct ctacttgttt 360 

ctgagcgtgt qcz z : q*cc%.j cctgqccatc ctctcggtgt cagccatcaa tgtggagcgc 420 

tactattacg tagt ccaccc catgcgctac gaggtgcgca tgacgctggg gctggtggcc 480 

tctgtgctgg tgcqtgtgtg qgtgaaggcc ttggccatgg cttctgtgcc agtgttggga 540 

agggtctcct ggqagga^sqg agctcccagt gtccccccag gctgttcact ccagtggagc 600 

cacagtgcct actgccaocL crttgtggtg gtctttgctg tcctttactt tctgttgccc 660 

ctgctcctca tacttgtqqt ctactgcagc atgttccgag tggcccgcgt ggctgccatg 720 

cagcacgggc cgctgcccac gtggatggag acaccccggc aacgctccga atctctcagc 780 

agccgctcca cgatgqrcac cagctcgggg gccccccaga ccaccccaca ccggacgttt 840 

gggggaggga aagcagcaqt ggttctcctg gctgtggggg gacagttcct gctctgttgg 900 

ttgccctact tctctttcca cctctatgtt gccctgagtg ctcagcccat ttcaactggg 960 

caggtggaga gtgtggtcac ctggattggc tacttttgct tcacttccaa ccctttcttc 1020 

tatggatgtc tcaaccggca gutccggggg gagctcagca agcagtttgt ctgcttcttc 1080 

aagccagctc cagaggagga gctgaggctg cctagccggg agggctccat tgaggagaac 1140 

ttcctgcagt tccttcaggg gactggctgt ccttctgagt cctgggtttc ccgaccccta 1200 

cccagcccca agcaggagcc acctgctgtt gactttcgaa tcccaggcca gatagctgag 1260 

gagacctctg agttcctgga gcagcaactc accagcgaca tcatcatgtc agacagctac 1320 
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c tccgt cc t g 


ccgcctcacc 


ccggct ggag 


tcagcgatat 


ctgcagaatt 


ccaccacact 


1380 


ggactagtgg 


atccgagctc 


ggtaccaagc 


ttgggctgca 


ggtcgatggg 


ctgcctcggc 


14 4 0 


aacagtaaga 


ccgaggacca 


gcgcaacgag 


gagaaggcgc 


agcgcgaggc 


caacaaaaag 


1500 


atcgagaagc 


agctgcagaa 


ggacaagcag 


gtctaccggg 


ccacgcaccg 


cctgct.gctg 


1560 


ctgggtgctg 


gagagtctgg 


caaaagcacc 


attgtgaagc 


agatgaggat 


cctacatgtt 


1620 


aatgggttta 


acggagaggg 


cggcgaagag 


gacccgcagg 


ctgcaaggag 


caacagcgat 


1680 


ggtgagaagg 


ccaccaaagt 


gcaggacatc 


aaaaacaacc 


tgaaggaggc 


cattgaaacc 


1740 


attgtggccg 


ccatgagcaa 


cctggtgccc 


cccgtggagc 


tggccaaccc 


tgagaaccag 


1800 


trcagagtgg 


actacattct 


gagcgtgatg 


aacgtgccaa 


actttgactt 


cccacctgaa 


1860 


ttcta tgagc 


atgccaaggc 


tctgtgggag 


gatgagggag 


ttcgtgcctg 


ctacgagcgc 


1920 


t ccaacgagt 


accagctga t 


cgactgtgcc 


cagtacttcc 


tggacaagat 


tgatgtgatc 


1980 


aagcaggccg 


actacgt gcc 


aagtgaccag 


gacctgcttc 


gctgccgcgt 


cctgacctct 


2040 


ggaa t ct ttg 


agaccaagtt 


ccaggtggac 


aaagtcaact 


tccacatgtt 


cgatgtgggc 


2100 


ggccagcgcq 


at gaacgccg 


caagtggat.c 


cagtgcttca 


atgatgtgac 


tgccatcatc 


2160 


1 1 cgtggtgg 


cc«jgcagcag 


ct acaacatg 


gtcatccggg 


aggacaacca 


gaccaaccgt 


2220 


crgcaggagq 


ct ct gaacct 


cttcaagagc 


atctggaaca 


acagatggct 


gcgtaccatc 


2280 


tc tgtgaccc 


tct tcctcaa 


caagcaagat 


ctgcttgctg 


agaaggtcct 


cgctgggaaa 


2340 


tcgaagacrq 


agcact act t 


tccagagttc 


gctcgctaca 


ccactcctga 


ggatgcgact 


2400 


cccgagcccg 


gaqaqgaccc 


acgcgtgacc 


cgggccaagt 


acttcatccg 


ggatgagttt 


2460 


ctgagaatca 


Gcactgctag 


tggagatgga 


cgtcactact 


gctaccctca 


ctttacctgc 


2520 


gccg tggaca 


ctgaqaacat 


ccgccgtgtc 


ttcaacgact 


gccgtgacat 


catccagcgc 


2580 


atgcatcr. tc 


gcccia tacga 


gctgctctaa 








2610 



<210> 100 
<211> 869 
<212> PRT 

<213> Homo sapiens and Rat 
<400> 100 

Met Glu Ser Ser Pro He Pro Gin Ser Ser Gly Asn Ser Ser Thr Leu 
15 10 15 

Gly Arg Val Pro Gin Thr Pro Gly Pro Ser Thr Ala Ser Gly Val Pro 
20 25 30 

Glu Val Gly Leu Arg Asp Val Ala Ser Glu Ser Val Ala Leu Phe Phe 
35 40 45 

Met Leu Leu Leu Asp Leu Thr Ala Val Ala Gly Asn Ala Ala Val Met 
50 55 60 
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Ala Val He Ala Lys Thr Pro Ala Leu Arg Lys Phe Val Phe Val Phe 
65 70 75 80 

His Leu Cys Leu Val Asp Leu Leu Ala Ala Leu Thr Leu Met Pro Leu 
85 90 95 

Ala Met Leu Ser Ser Ser Ala Leu Phe Asp His Ala Leu Phe Gly Glu 
100 105 110 

Val Ala Cys Arg Leu Tyr Leu Phe Leu Ser Val Cys Phe Val Ser Leu 
lis 120 125 

Ala He Leu Ser Val Ser Ala He Asn Val Glu Arg Tyr Tyr Tvr Val 
130 135 140 

Val His Pro Met Arg Tyr Glu Val Arg Met Thr Leu Gly Leu Val Ala 
14^ 150 155 160 

Ser Val Leu Val Gly Val Trp Val Lys Ala Leu Ala Met Ala Ser Val 
165 170 175 

Pro Val Leu Gly Arg Val Ser Trp Glu Glu Gly Ala Pro Ser Val Pro 
180 185 190 

Pro Gly Cys Ser Leu Gin Trp Ser His Ser Ala Tyr Cys Gin Leu Phe 
195 200 205 

Val Val Val Phe Ala Val Leu Tyr Phe Leu Leu Pro Leu Leu Leu He 
210 215 220 

Leu Val Val Tyr Cys Ser Met Phe Arg Val Ala Arg Val Ala Ala Met 
225 230 235 240 

Gin His Gly Pro Leu Pro Thr Trp Met Glu Thr Pro Arg Gin Arg Ser 
245 250 255 

Glu Ser Leu Ser Ser Arg Ser Thr Met Val Thr Ser Ser Gly Ala Pro 
260 265 270 

Gin Thr Thr Pro His Arg Thr Phe Gly Gly Gly Lys Ala Ala Val Val 
275 280 285 

Leu Leu Ala Val Gly Gly Gin Phe Leu Leu Cys Trp Leu Pro Tyr Phe 
290 295 300 

Ser Phe His Leu Tyr Val Ala Leu Ser Ala Gin Pro He Ser Thr Gly 
305 310 315 320 

Gin Val Glu Ser Val Val Thr Trp He Gly Tyr Phe Cys Phe Thr Ser 
325 330 335 

Asn Pro Phe Phe Tyr Gly Cys Leu Asn Arg Gin He Arg Gly Glu Leu 
340 345 350 

Ser Lys Gin Phe Val Cys Phe Phe Lys Pro Ala Pro Glu Glu Glu Leu 
355 360 365 

Arg Leu Pro Ser Arg Glu Gly Ser He Glu Glu Asn Phe Leu Gin Phe 
370 375 380 

Leu Gin Gly Thr Gly Cys Pro Ser Glu Ser Trp Val Ser Arg Pro Leu 
385 390 395 400 

Pro Ser Pro Lys Gin Glu Pro Pro Ala Val Asp Phe Arg He Pro Gly 
405 410 415 
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Gin lie Ala Glu Glu Thr Ser Glu Phe Leu Glu Gin Gin Leu Thr Ser 
420 425 430 

Asp lie lie Met Ser Asp Ser Tyr Leu Arg Pro Ala Ala Ser Pro Arg 
435 ' 440 445 

Leu Glu Ser Ala lie Ser Ala Glu Phe His His Thr Gly Leu Val Asp 
450 455 460 

Pro Ser Ser Val Pro Ser Leu Gly Cys Arg Ser Met Gly Cys Leu Gly 
465 470 475 480 

Asn Ser Lys Thr Glu Asp Gin Arg Asn Glu Glu Lys Ala Gin Arg Glu 
485 490 495 

Ala Asn Lys Lys He Glu Lys Gin Leu Gin Lys Asp Lys Gin Val Tyr 
500 505 510 

Arg Ala Thr His Arg Leu Leu Leu Leu Gly Ala Gly Glu Ser Gly Lys 
515 520 525 

Ser Thr He Val Lys Gin Met Arg He Leu His Val Asn Gly Phe Asn 
530 535 540 

Gly Glu Gly Gly Glu Glu Asp Pro Gin Ala Ala Arg Ser Asn Ser Asp 
545 550 555 560 

Gly Glu Lys Ala Thr Lys Val Gin Asp He Lys Asn Asn Leu Lys Glu 
565 570 575 

Ala He Glu Thr He Val Ala Ala Met Ser Asn Leu Val Pro Pro Val 
580 585 590 

Glu Leu Ala Asn Pro Glu Asn Gin Phe Arg Val Asp Tyr He Leu Ser 
595 600 605 

Val Met Asn Val Pro Asn Phe Asp Phe Pro Pro Glu Phe Tyr Glu His 
610 615 620 

Ala Lys Ala Leu Trp Glu Asp Glu Gly Val Arg Ala Cys Tyr Glu Arg 
625 630 635 640 

Ser Asn Glu Tyr Gin Leu He Asp Cys Ala Gin Tyr Phe Leu Asp Lys 
645 650 655 

He Asp Val He Lys Gin Ala Asp Tyr Val Pro Ser Asp Gin Asp Leu 
660 665 670 

Leu Arg Cys Arg Val Leu Thr Ser Gly He Phe Glu Thr Lys Phe Gin 
675 680 685 

Val Asp Lys Val Asn Phe His Met Phe Asp Val Gly Gly Gin Arg Asp 
690 695 700 

Glu Arg Arg Lys Trp He Gin Cys Phe Asn Asp Val Thr Ala He He 
705 710 715 720 

Phe Val Val Ala Ser Ser Ser Tyr Asn Met Val He Arg Glu Asp Asn 
725 730 735 

Gin Thr Asn Arg Leu Gin Glu Ala Leu Asn Leu Phe Lys Ser He Trp 
740 745 750 

Asn Asn Arg Trp Leu Arg Thr He Ser Val He Leu Phe Leu Asn Lys 
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755 760 765 

Gin Asp Leu Leu Ala Glu Lys Val Leu Ala Gly Lys Ser Lys lie Glu 
770 775 780 

Asp Tyr Phe Pro Glu Phe Ala Arg Tyr Thr Thr Pro Glu Asp Ala Thr 
785 790 795 800 

Pro Glu Pro Gly Glu Asp Pro Arg Val Thr Arg Ala Lys Tyr Phe lie 
805 810 815 

Arg Asp Glu Phe Leu Arg lie Ser Thr Ala Ser Gly Asp Gly Arg His 
820 825 830 

Tyr Cys Tyr Pro His Phe Thr Cys Ala Val Asp Thr Glu Asn lie Arg 
835 840 845 

Arg Val Phe Asn Asp Cys Arg Asp lie lie Gin Arg Met His Leu Arg 
850 855 860 

Gin Tyr Glu Leu Leu 
865 

<210> 101 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 

<223> Novel Sequence 

<400> 101 

tctagaatga cgtccacctg caccaacagc 30 

<210> 102 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 

<223> Novel Sequence 

<400> 102 

gatatcgcag gaaaagtagc agaatcgtag gaag 34 — 

<210> 103 
<211> 2781 
<212> DNA 

<213> Homo Sapiens and Rat 
<400> 103 

atgacgtcca cctgcaccaa cagcacgcgc gagagtaaca gcagccacac gtgcatgccc 60 
ctctccaaaa tgcccatcag cctggcccac ggcatcatcc gctcaaccgt gctggttatc 120 
ttcctcgccg cctctttcgt cggcaacata gtgctggcgc tagtgttgca gcgcaagccg 180 
cagctgctgc aggtgaccaa ccgttttatc tttaacctcc tcgtcaccga cctgctgcag 240 
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atttcgctcg tggccccctg ggtggtggcc acctctgtgc ctctcttctg gcccctcaac 300 

agccacttct gcacggccct ggttagcctc acccacctgt tcgccttcgc cagcgtcaac 360 

accattgtcg tggtgtcagt ggatcgctac ttgtccatca tccaccctct ctcctacccg 420 

tccaagatga cccagcgccg cggttacctg ctcctctatg gcacctggat tgtggccatc 480 

ctgcagagca ctcctccact ctacggctgg ggccaggctg cctttgatga gcgcaatgct 540 

ctctgctcca tgatctgggg ggccagcccc agctacacta ttctcagcgt ggtgtccttc 600 

atcgtcattc cactgattgt catgattgcc tgctactccg tggtgttctg tgcagcccgg 660 

aggcagcatg ctctgctgta caatgtcaag agacacagct tggaagtgcg agtcaaggac 720 

tgtgtggaga atgaggatga agagggagca gagaagaagg aggagttcca ggatgagagt 780 

gagtttcgcc gccagcatga aggtgaggtc aaggccaagg agggcagaat ggaagccaag 840 

gacggcagcc tgaaggccaa ggaaggaagc acggggacca gtgagagtag tgtagaggcc 900 

aggggcagcg aggaggtcag agagagcagc acggtggcca gcgacggcag catggagggt 960 

aaggaaggca gcaccaaagt tgaggagaac agcatgaagg cagacaaggg tcgcacagag 1020 

gtcaaccagt gcagcattga cttgggtgaa gatgacatgg agtttggtga agacgacatc 1080 

aatttcagtg aggatgacgt cgaggcagtg aacatcccgg agagcctccc acccagtcgt 1140 

cgtaacagca acagcaaccc tcctctgccc aggtgctacc agtgcaaagc tgctaaagtg 1200 

atcttcatca tcattttctc ctatgtgcta tccctggggc cctactgctt tttagcagtc 1260 

ctggccgtgt gggtggatgt cgaaacccag gtaccccagt gggtgatcac cataatcatc 1320 

tggcttttct tcctgcagtg ctgcatccac ccctatgtct atggctacat gcacaagacc 1380 

attaagaagg aaatccagga catgctgaag aagttcttct gcaaggaaaa gcccccgaaa 14 40 

gaagatagcc acccagacct gcccggaaca gagggtggga ctgaaggcaa gattgtccct 1500 

tcctacgatt ctgctacttt tcctgcgata tctgcagaat tccaccacac tggactagtg 1560 

gatccgagct cggtaccaag cttgggctgc aggtcgatgg gctgcctcgg caacagtaag 1620 

accgaggacc agcgcaacga ggagaaggcg cagcgcgagg ccaacaaaaa gatcgagaag 1680 

cagctgcaga aggacaagca ggtctaccgg gccacgcacc gcctgctgct gctgggtgct 1740 

ggagagtctg gcaaaagcac cattgtgaag cagatgagga tcctacatgt taatgggttt 1800 

aacggagagg gcggcgaaga ggacccgcag gctgcaagga gcaacagcga tggtgagaag 1860 

gccaccaaag tgcaggacat caaaaacaac ctgaaggagg ccattgaaac cattgtggcc 1920 

gccatgagca acctggtgcc ccccgtggag ctggccaacc ctgagaacca gttcagagtg 1980 

gactacattc tgagcgtgat gaacgtgcca aactttgact tcccacctga attctatgag 2040 

catgccaagg ctctgtggga ggatgaggga gttcgtgcct gctacgagcg ctccaacgag 2100 

taccagctga tcgactgtgc ccagtacttc ctggacaaga ttgatgtgat caagcaggcc 2160 

gactacgtgc caagtgacca ggacctgctt cgctgccgcg tcctgacctc tggaatcttt 2220 
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caaagtcaac 


ttccacatgt 


tcgatgtggg 


cggccagcgc 


2280 






^ ^ a ^ ^ <^ +■ ^ 

C-CdytgCt-UC 


aatgatgtga 


ctgccatcat 


cttcgtggtg 


2340 


nr^f*^ ^ 


y^ Ld^ddC^dl.. 


yg LCdL-ccgg 


gaggacaacc 


agaccaaccg 


tctgcaggag 


2400 


o t" f^'t' ^ ^ c 

^Oi.t_> 1— ^ O O 1 . \^ 




d L.(„L.ggddC 


aacagatggc 


tgcgtaccat 


ctctgtgatc 


24 60 




dC^dcavj^^ddyd 


x.ctgctugct 


gagaaggtcc 


tcgctgggaa 


atcgaagatt 


2520 


gaggactact 


t tccagagtt 


cgctcgctac 


accactcctg 


aggatgcgac 


tcccgagccc 


2580 


ggagaggacc 


cacgcgtgac 


ccgggccaag 


tacttcatcc 


gggatgagtt 


tctgagaatc 


2640 


agcactgcta 


gtggagatgg 


acgtcactac 


tgctaccctc 


actttacctg 


cgccgtggac 


2700 


actgagaaca 


tccgccgtgt 


cttcaacgac 


tgccgtgaca 


tcatccagcg 


catgcatctt 


2760 


cgccaatacg 


agctgctcta 


a 








2781 



<210> 104 
<211> 926 
<212> PRT 
<213> Homo 


sapiens 


and 


Rat 




















<400> 


104 




























Met Thr 
1 


Ser 


Thr 


Cys 
5 


Thr 


Asn 


Ser 


Thr 


Arg 
10 


Glu 


Ser 


Asn 


Ser 


Ser 
15 


His 


Thr Cys 


Met 


Pro 
20 


Leu 


Ser 


Lys 


Met 


Pro 
25 


He 


Ser 


Leu 


Ala 


His 
30 


Gly 


He 


lie Arg 


Ser 
35 


Thr 


Val 


Leu 


Val 


He 
40 


Phe 


Leu 


Ala 


Ala 


Ser 
45 


Phe 


Val 


Gly 


Asn lie 
50 


Val 


Leu 


Ala 


Leu 


Val 
55 


Leu 


Gin 


Arg 


Lys 


Pro 
60 


Gin 


Leu 


Leu 


Gin 


Val Thr 
65 


Asn 


Arg 


Phe 


He 
70 


Phe 


Asn 


Leu 


Leu 


Val 
75 


Thr 


Asp 


Leu 


Leu 


Gin 
80 


He Ser 


Leu 


Val 


Ala 
85 


Pro 


Trp 


Val 


Val 


Ala 
90 


Thr 


Ser 


Val 


Pro 


Leu 
95 


Phe 


Trp Pro 


Leu 


Asn 
100 


Ser 


His 


Phe 


Cys 


Thr 
105 


Ala 


Leu 


Val 


Ser 


Leu 
110 


Thr 


His 


Leu Phe 


Ala 
115 


Phe 


Ala 


Ser 


Val 


Asn 
120 


Thr 


He 


Val 


Val 


Val 
125 


Ser 


Val 


Asp 


Arg Tyr 
130 


Leu 


Ser 


He 


He 


His 
135 


Pro 


Leu 


Ser 


Tyr 


Pro 
140 


Ser 


Lys 


Met 


Thr 


Gin Arg 
145 


Arg 


Gly 


Tyr 


Leu 
150 


Leu 


Leu 


Tyr 


Gly 


Thr 
155 


Trp 


He 


Val 


Ala 


He 
160 


Leu Gin 


Ser 


Thr 


Pro 
165 


Pro 


Leu 


Tyr 


Gly 


Trp 
170 


Gly 


Gin 


Ala 


Ala 


Phe 
175 


Asp 


Glu Arg 


Asn 


Ala 
180 


Leu 


Cys 


Ser 


Met 


He 
185 


Trp 


Gly 


Ala 


Ser 


Pro 
190 


Ser 


Tyr 
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Thr lie Leu Ser Val Val Ser Phe He Val He Pro Leu He Val Met 
195 200 205 

He Ala Cys Tyr Ser Val Val Phe Cys Ala Ala Arg Arg Gin His Ala 
210 215 220 

Leu Leu Tyr Asn Val Lys Arg His Ser Leu Glu Val Arg Val Lys Asp 
225 230 235 240 

Cys Val Glu Asn Glu Asp Glu Glu Gly Ala Glu Lys Lys Glu Glu Phe 
245 250 255 

Gin Asp Glu Ser Glu Phe Arg Arg Gin His Glu Gly Glu Val Lys Ala 
260 265 270 

Lys Glu Gly Arg Met Glu Ala Lys Asp Gly Ser Leu Lys Ala Lys Glu 
275 280 285 

Gly Ser Thr Gly Thr Ser Glu Ser Ser Val Glu Ala Arg Gly Ser Glu 
290 295 300 

Glu Val Arg Glu Ser Ser Thr Val Ala Ser Asp Gly Ser Met Glu Gly 
305 310 315 320 

Lys Glu Gly Ser Thr Lys Val Glu Glu Asn Ser Met Lys Ala Asp Lys 
325 330 335 

Gly Arg Thr Glu Val Asn Gin Cys Ser He Asp Leu Gly Glu Asp Asp 
340 345 350 

Met Glu Phe Gly Glu Asp Asp He Asn Phe Ser Glu Asp Asp Val Glu 
355 360 365 

Ala Val Asn He Pro Glu Ser Leu Pro Pro Ser Arg Arg Asn Ser Asn 
370 375 380 

Ser Asn Pro Pro Leu Pro Arg Cys Tyr Gin Cys Lys Ala Ala Lys Val 
385 390 395 400 

He Phe He He He Phe Ser Tyr Val Leu Ser Leu Gly Pro Tyr Cys 
405 410 415 

Phe Leu Ala Val Leu Ala Val Trp Val Asp Val Glu Thr Gin Val Pro 
420 425 430 

Gin Trp Val He Thr He He He Trp Leu Phe Phe Leu Gin Cys Cys 
435 440 445 

He His Pro Tyr Val Tyr Gly Tyr Met His Lys Thr He Lys Lys Glu 
450 455 460 

He Gin Asp Met Leu Lys Lys Phe Phe Cys Lys Glu Lys Pro Pro Lys 
465 470 475 480 

Glu Asp Ser His Pro Asp Leu Pro Gly Thr Glu Gly Gly Thr Glu Gly 
485 490 495 

Lys He Val Pro Ser Tyr Asp Ser Ala Thr Phe Pro Ala He Ser Ala 
500 505 510 

Glu Phe His His Thr Gly Leu Val Asp Pro Ser Ser Val Pro Ser Leu 
515 520 525 

Gly Cys Arg Ser Met Gly Cys Leu Gly Asn Ser Lys Thr Glu Asp Gin 
530 535 540 

Page 66 



8NSDOCIO: <WO 01 35471 A2_l_> 



wo 01/36471 



PCT/USOO/31509 



Arg Asn Glu Glu Lys Ala Gin Arg Glu Ala Asn Lys Lys lie Giu Lys 
545 550 555 560 

Gin Leu Gin Lys Asp Lys Gin Val Tyr Arg Ala Thr His Arg Leu Leu 
565 570 575 

Leu Leu Gly Ala Gly Glu Ser Gly Lys Ser Thr He Val Lys Gin Met 
580 585 590 

Arg He Leu His Val Asn Gly Phe Asn Gly Glu Gly Gly Glu Glu Asp 
595 600 605 

Pro Gin Ala Ala Arg Ser Asn Ser Asp Gly Glu Lys Ala Thr Lys Val 
610 615 620 

Gin Asp He Lys Asn Asn Leu Lys Glu Ala He Glu Thr He Val Ala 
625 630 635 640 

Ala Met Ser Asn Leu Val Pro Pro Val Glu Leu Ala Asn Pro Glu Asn 
645 650 655 

Gin Phe Arg Val Asp Tyr He Leu Ser Val Met Asn Val Pro Asn Phe 
660 665 670 

Asp Phe Pro Pro Glu Phe Tyr Glu His Ala Lys Ala Leu Trp Glu Asp 
675 680 685 

Glu Gly Val Arg Ala Cys Tyr Glu Arg Ser Asn Glu Tyr Gin Leu He 
690 695 700 

Asp Cys Ala Gin Tyr Phe Leu Asp Lys He Asp Val He Lys Gin Ala 
705 710 715 720 

Asp Tyr Val Pro Ser Asp Gin Asp Leu Leu Arg Cys Arg Val Leu Thr 
725 730 735 

Ser Gly He Phe Glu Thr Lys Phe Gin Val Asp Lys Val Asn Phe His 
740 745 750 

Met Phe Asp Val Gly Gly Gin Arg Asp Glu Arg Arg Lys Trp He Gin 
755 760 765 

Cys Phe Asn Asp Val Thr Ala He He Phe Val Val Ala Ser Ser Ser 
770 775 780 

Tyr Asn Met Val He Arg Glu Asp Asn Gin Thr Asn Arg Leu Gin Glu 
785 790 795 800 

Ala Leu Asn Leu Phe Lys Ser He Trp Asn Asn Arg Trp Leu Arg Thr 
805 810 815 

He Ser Val He Leu Phe Leu Asn Lys Gin Asp Leu Leu Ala Glu Lys 
820 825 830 

Val Leu Ala Gly Lys Ser Lys He Glu Asp Tyr Phe Pro Glu Phe Ala 
835 840 845 

Arg Tyr Thr Thr Pro Glu Asp Ala Thr Pro Glu Pro Gly Glu Asp Pro 
850 855 860 

Arg Val Thr Arg Ala Lys Tyr Phe He Arg Asp Glu Phe Leu Arg He 
865 870 875 880 

Ser Thr Ala Ser Gly Asp Gly Arg His Tyr Cys Tyr Pro His Phe Thr 
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885 890 895 

Cys Ala Val Asp Thr Glu Asn lie Arg Arg Val Phe Asn Asp Cys Arg 
900 905 910 

Asp lie He Gin Arg Met His Leu Arg Gin Tyr Glu Leu Leu 
915 920 925 



<210> 


105 


<211> 


23 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


raisc_f eature 


<223> 


Novel Sequence 


<400> 


105 


catgtatgcc agc:;tcctgc tec 


<210> 


106 


<211> 


24 


<212> 


DNA 


<213> 


Artificial n-.-qucncc 


<220> 




<221> 


inisc_f»J'J' ur c 


<223> 


Novel :;»'q--r». *^ 


<400> 


106 


gctatgcctg aagcc»»gt.c: t.gtg 


<210> 


107 


<211> 


25 


<212> 


DNA 


<213> 


Artificial :;«fquence 


<220> 




<221> 


inisc^f eature 


<223> 


. Novel Sequor.ce 


<400> 


107 



23 



24 



gcacctgctc ctgagcacct tctcc 



25 



<210> 


108 


<211> 


26 


<212> 


DNA 


<213> 


Artificial Seqeunce 


<220> 




<221> 


misc__f eature 


<223> 


Novel Sequence 


<400> 


108 



cacagcgctg cagccctgca gctggc 26 
<210> 109 
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<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 

<223> Novel Sequence 

<400> 109 

ccagtgatga ctctgtccag cctg 24 



<210> 


110 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


mis cofeature 


<223> 


Novel Sequence 


<400> 


110 


cagacacttg gcagggacga ggtg 


<210> 


111 


<211> 


26 


<212> 


DNA 


<213> 


Artficial Sequence 


<220> 




<221> 


misc feature 


<223> 


Novel Sequence 



24 



<400> 111 

cttgtggtct actgcagcat gttccg 26 



<210> 


112 


<211> 


25 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_feature 


<223> 


Novel Sequence 


<400> 


112 


catatccctc cgagtgtcca gcggc 


<210> 


113 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc feature 


<223> 


Novel Sequence 



25 
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<400> 


113 


atggatcctt atcatggctt cctc 


<210> 


114 


<211> 


27 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


114 


caagaacagg rctcatctaa gagctcc 


<210> 


115 


<211> 


26 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


115 


ctctgatgcc atctgctgga ttcctg 


<210> 


116 


<211> 


26 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


116 


gtagtccact gaaagtccag tgatcc 


<210> 


117 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_feature 


<223> 


Novel Sequence 


<400> 


117 


tggtggcgat ggccaacagc gctc 


<210> 


118 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 



24 



27 



26 



26 



24 
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<220> 

<221> misc_feature 
<223> Novel Sequence 



<400> 118 

gttgcgcctt agcgacagat gacc 



24 



<210> 


119 


<211> 


23 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_feature 


<223> 


Novel Sequence 


<400> 


119 


tcaacctgt a trtccagcatc etc 


<210> 


120 


<211> 


23 



23 



<212> DNA 

<213> Artifivriui Sequence 
<220> 

<221> misc^i r ur«- 

<223> Novel wequ«?nc** 



<400> 120 

aaggagtagc aqd^t agtta gcc 



23 



<210> 


121 


<211> 


24 


<212> 


DNA 


<213> 


Artif ic Sequence 


<220> 




<221> 


misc^feature 


<223> 


Novel Sequence 


<400> 


121 


gacacctgtc agcggtcctg tgtg 


<210> 


122 


<211> 


27 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc^f eature 


<223> 


Novel Sequence 



24 - 



<400> 122 

ctgatggaag tagaggctgt ccatctc 



27 
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<210> 


123 


<211> 


24 


<212> 


DNA 


<213> 


Articial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


123 


gcgctgagcg cagaccagtg gctg 


<210> 


124 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<22i> 


mi s cofeature 


<223> 


Novel Sequence 



24 



<400> 124 

cacggtgacq aagggcacga gctc 



24 



<210> 12b 

<211> 24 

<212> DNA 

<213> Arti^*cial Sequence 
<220> 

<221> mi5C_feature 

<223> Novol Sequence 



<400> 12b 

agccaccccl gccaggaagc atgg 



24 



<210> 


126 


<211> 


2 5 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc feature 


<223> 


Novel Sequence 


<400> 


126 


ccaggtaggt gtqcdgcaca atggc 


<210> 


127 


<211> 


25 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc feature 



25 



<223> Novel Sequence 
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<400> 127 

ctgttcaaca gggctggttg gcaac 25 



<210> 


128 


<211> 


25 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


-128 


atcatgtcta gactcatggt gate 


<210> 


129 


<211> 


6 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 


<400> 


129 


Thr Leu Glu Ser lie Met 


1 


5 


<210> 


130 


<211> 


5 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 



<400> 130 

Glu Tyr Asn Leu Val 
1 5 



<210> 
<211> 
<212> 
<213> 


131 
5 

PRT 

Artificial Sequence 


<220> 
<221> 
<223> 


misc_f eature 
Novel Sequence 


<400> 


131 


Asp Cys Gly Leu Phe 
1 5 


<210> 


132 



25 
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<211> 36 

<212> PRT 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 

<223> Novel Sequence 



<400> 132 

Glv Ala Thr Cys Ala Ala Gly Cys Thr Thr Cys Cys Ala Thr Gly Giy 
1 5 10 15 

Cvs Gly Thr Gly Cys Thr Gly Cys Cys Thr Gly Ala Gly Cys Gly Ala 
20 25 30 

Gly Gly Ala Gly 





35 


<210> 


133 


<211> 


53 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<221> 


misc_f eature 


<223> 


Novel Sequence 



<400> 133 

Glv Ala Thr Cys Gly Gly Ala Thr Cys Cys Thr Thr Ala Gly Ala Ala 
1 5 10 15 

Cvs Ala Gly Gly Cys Cys Gly Cys Ala Gly Thr Cys Cys Thr Thr Cys 
20 25 30 

Ala Gly Gly Thr Thr Cys Ala Gly Cys Thr Gly Cys Ala Gly Gly Ala 
35 40 45 

Thr Gly Gly Thr Gly 
50 
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